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Message from the Chairman 

D Fouadar. PES 



I am pleased to convey my warm greetings to all the participants of the International Conference 
RETCOMP '2013 organized by Department of CS & E, PESIT South Campus. 

I congratulate department of CS & E, PESIT South Campus for hosting its First "International 
Conference on Recent Trends in Computer and computational sciences (RETCOMP)" during Jan 11-13, 
2013. 

I learn that many Technocrats, educationalists and academic personnel from National level and 
International level are involved in technical/science education congregate and exchange their experiences 
in the respective domain. This conference brings together academic luminaries and I am delighted such a 
significant event is happening in the precincts of PES Institutions. 

I wish the organizing committee Department of CS & E the very best in their endeavor. I eagerly 
look forward to meet all the delegates and speakers from various parts of the country and world during 
this conference. 

Thank you and wish the conference a grand success. 



Dr. M R Doreswamy, MLC 

Founder Chairman-PES Group of Institutions, 

Bengaluru 
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Message from the CEO 
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I am happy that Department of CS & E, PESIT South Campus is hosting the 1 st 
International Conference on Recent Trends in Computer and computational sciences 
(RETCOMP) during Jan 11-13, 2013. I take this opportunity to extend my greetings and best 
wishes to all participants. 

The conference aims to stress upon importance of mathematics and physical science in 
the field of Computer Science and Engineering and hopes to create a platform where eminent 
researchers from various relevant background interact and collaborate. 

It is learnt that more than Sixty five papers have been received from all over the world. 
Approximately 20% of the papers submitted have been accepted for presentation and publication 
in the proceedings. 

All the staff members of CS & E department have been working hard for months together 
for the success of this conference. I sincerely convey my heartfelt congratulations to all 
members, both teaching as well as non-teaching who have put efforts for this event. 

I am sure the deliberations of the conference comprising of special panel discussions, 
technical paper presentations and key-note addresses will bring out lot of value to effect desired 
changes in learning experience. 

Warm wishes, 

Prof. D. Jawahar 

CEO, PES Group of Institutions, Bengaluru 
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Message from the COO 



It gives me immense pleasure and pride to be a part of the First International Conference 
"RETCOMP-2013" at PES Institute of Technology, Bengaluru. 

The conference will bring together leading researchers, engineers and scientists in the 
domain of Computer networking, Data mining, Image Processing with a special session on 
interface opportunities between Mathematical, Physical and Computer sciences. 

Speakers have assembled from different parts of the country and abroad and I take this 
opportunity to thank them for taking time out of their busy schedules for the conference. 

The efforts of the faculty and staff members of the Dept. of CS & E are commendable 
and I'm sure that the outcome of the event will be commensurate with the time spent in 
organizing the conference. 



Prof. Ajoy Kumar 

COO, PES Group of Institutions, Bengaluru 
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. , . . Dr. J. Suryaprasad 

Message from the Finance Chair Director, pesse 



Research is considered as an integral part of academics. It not only enhances the learning, but also 
provides opportunities to solve many challenging issues that humans encounter. Critical today is cross- 
disciplinary research which needs specialists with multiple domain knowledge get to the crux of 
challenging issues and explore/propose suitable solutions. 

RETCOMP is one such opportunity which is a confluence of scientists and technologists. Though 
focused in terms of the domain, RETCOMP is a platform for explaining & showcasing the influence of 
sciences more so computational sciences, on technology domain including computers and its allied 
streams. Need of the hour are such platforms which helps the ecosystem by sharing interesting research 
findings in a fast technologically changing world. 

We sincerely wish and hope that RETCOMP would open many meaningful opportunities to 
share, collaborate and explore challenging domains that spreads across multiple disciplines in the years to 
come. 



With Best Wishes 

Dr.Suryaprasad J 
Director/Principal 
PES Institute of Technology, Bangalore South Campus 
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Message from the Organizing Chair 



It is my proud privilege to say that the department of computer science and engineering is 
organizing the first international conference on Recent Trends in computer and computational 
Sciences. The conference theme is focused on the recent trends in both technology and science 
and also provides an opportunity to students to reflect their young minds. I am very glad to say 
that professors and scientists from reputed universities from Russia and India, IIT's and ISI 
Kolkata are sharing their expertise with the participants. The conference provides a forum where 
the latest developments in the area of Pattern Recognition, Digital Image Processing, Mobile 
Computing, Networks and computational sciences will be explored by many enthusiastic 
researchers. I congratulate everyone who have contributed to this conference with their papers. I 
thank the reviewers, speakers and the session chairs for accepting our requests to be the part of 
conference and also for their valuable time and efforts. I thank the Management and Director for 
providing us an wonderful opportunity to bring the stalwarts in the area of Computer and 
Computational Science to our campus. I congratulate Dr.Snehanshu Saha and the faculty 
members for sorting through all the myriad challenges of putting together this fine conference. I 
hope the conference will be valuable. 

With Best Wishes 

Dr Srikanta Murthy K 

Organizing Chair, 

HOD, Dept of CSE, 

PES Institute of Technology, Bangalore South Campus 
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Message from the Program Chair 



As Science reflects the business world increasingly the prospect of investing in Computational 
Science is expected to go up several notches. The use of advanced computing algorithms to facilitate 
complex phenomena in physical, chemical, biological, and materials systems is gaining momentum. 
Consequently, the field of computational sciences intends to go deeper towards understanding large 
quantities of data. As some cultures change some trends in traditional computer science continues to hold 
sway and the barriers between the defined domains of computer science and computational science 
continue to exist. 

This is where I believe academic institutions have significant role to play. Science needs "state- 
of-the-art" computing machines to be embedded in the research methodology. The community of 
Computer Engineers is constantly in search of solving intriguing and exciting problems with enhanced 
speed and accuracy largely ignoring the needs of their peers in the scientific community. A conference 
such as "RETCOMP'2013" gives us the opportunity to break down such barriers by bringing together 
academicians and industry think-tank. The objectives RETCOMP'2013 intend to meet demand finesse, 
novelty and power in devising new algorithms/methods and/or discuss existing ones. 

PES Institute of Technology, Bangalore South Campus is young and thriving. The Department of 
Computer Science and Engineering is ambitious and willing to harness a platform of cooperation between 
the Engineering and Science communities in its endeavor to pilot such efforts on a greater scale in near 
and distant future. RETCOMP'2013 is the first step in that direction. 

I'm delighted that the department has been able to call upon a set of brilliant researchers and I'm 
grateful for the way they responded. I'm hopeful of a strong epistle emerging out of the meeting and look 
forward to meeting the invited speakers and participants. 

Happy Brain-storming! 

Warm wishes, 

SnehanshuSaha 

(Dr.Snehanshu Saha) 

Program Chair-RETCOMP'2013 
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Abstract. Graphs arising from real life applications are typically large, 
depending on the task at hand a simple yet effective strategy would be 
to partition the graph and process the resultant sub graphs individually. 
More generally suppose a problem with a given objective is decomposed 
into smaller instances, how does the combined solution from the sub in- 
stances compare with the optimal solution obtained from the original 
instance. In this work we investigate the application of such a strategy 
with respect to two NP-hard problems that arise in many applications, 
namely the Minimum Dominating Set and Maximum Coverage. Accord- 
ingly we define performance ratios and derive relationships between these 
and quantities that can be computed from a given graph and sub graphs 
obtained from partitioning it. We show that a "minimum vertex support 
cut" maximises a lower bound on a defined performance ratio, for the 
minimum dominating set and also a special case of the maximum cover- 
age. We empirically assess how some of the commonly used graph cuts 
such as the normalised cut fare with respect to the performance ratios 
defined above on both synthetic and real graphs. 

Keywords : Graph cut, Minimum Dominating Set, Maximum Cov- 
erage, Word of mouth marketing, Social Network Analysis, Greedy 
algorithm 

1 Introduction 

The information age has resulted in an explosion of data, thus the size of prob- 
lems in present day applications presents many challenges. Graphs are a very 
useful representation in many applications, typically graphs arising from real life 
applications are large. One possible way do deal with a large problem is to break 
it into smaller problems which can then be processed individually. A natural 
question is the following. Consider an instance of a problem where the aim is to 
minimize/maximize a given objective. Suppose the problem is decomposed into 
smaller sub instances and the optimal solutions from each of these sub instances 
are combined, how much worse is the combined solution from the sub instances 
as compared to the optimal solution obtained from the original instance. 

In this paper we consider the above question with respect to two graph related 
NP-hard problems. The Minimum Dominating Set (MDS) (and the set cover 
problem of which the MDS may be regarded as a special case) and Maximum 
Coverage problems arise in many applications, such as choosing an initial set of 
target nodes in a word of mouth marketing scenario for instance. These may be 
formally stated as follows. 

Minimum dominating set (MDS) : Given a graph G = (V,E), find a set 
V CLV of smallest cardinality such that for all u G V — V there is a v <E V for 
which (u, v) G E . 

The Minimum dominating set can be treated as an instance of the well known 
Set Cover Problem [2]. 

Maximum Coverage (MC) : Given a universe U of n elements and a family 
F of subsets ofU, and an integer k <| F \, find the k-element subset C G F (i.e. 
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\ C \— k) that maximises UsecS ■ i.e. choose a subset of the family F such that 
the number of elements covered is maximised. 

Although both the set cover problem and Maximum Coverage problem are NP- 
hard, a polynomial time greedy approximation algorithm may be employed; in 
each step of the greedy algorithm, the set S covering the largest number of 
remaining uncovered elements is picked. The only difference in the application 
in case of Maximum Coverage, is that the algorithm terminates after k steps. 

While there exists a significant amount of literature related to the dominat- 
ing set and maximum coverage problems, the main contribution of this paper is 
examining the above problems in a problem decomposition context. Given that 
many real world networks consist of a large number of nodes, it may often be 
infcasiblc to work on the entire graph. A practical strategy in such a situation 
would be to partition the graph into smaller subgraphs which could be processed 
individually. In this work we focus specifically on the following question given a 
graph and subgraphs obtained by partitioning it, how close is the combined solu- 
tion from the individual subgraphs to the optimal solution that could be obtained 
from the original graph (with respect to a particular objective). 

The outline of the rest of the paper is as follows. We introduce some notation 
in section 1.1. In section 2 we define performance ratios for the above problems 
in terms of the ratio of the combined optimal solution from the subgraphs to 
that obtained from the full graph, we also define corresponding empirical perfor- 
mance ratios which can be computed directly from a given graph and subgraphs 
obtained by partitioning it. In section 2.1, we state a few relationships between 
these corresponding quantities focusing mostly on the case of two partitions. We 
report empirical results on real and synthetic graphs, partitioning the graphs 
using the popular min and normalised cuts respectively 

1.1 Notation 

We introduce some notation which will be used in the rest of the paper. We 
consider a graph G = (V, E) with an incidence matrix A for the rest of the 
paper. Almost all the discussion in this paper does not require a distinction 
between weighted and unweighted graphs. Thus G = (V, E) could be considered 
as a binary coverage graph (V is the set of vertices and E the set of edges 
and node i is said to "cover" node j if G(i,j) = 1). In practice, G could be 
obtained from the original network, by applying criteria most suitable to the 
particular application. For instance the coverage matrix could be the binary 
matrix obtained by applying a threshold on the edge weights of the original 
network. Another possibility is that a node i could be deemed to cover node j if 
there is a path of length h or less from node i to node j. 

If the graph is used for modeling dynamics of interaction between individual 
entities such as propagation of viruses (whether biological or computer viruses) 
through a social network, word of mouth marketing etc. and the edge weights 
represent probabilities of activation, then retaining only edges with weight above 
a threshold (say Pthresh) amounts to enforcing that any currently inactive node 
has a probability of activation of at least pthresh if any of its neighbours are 
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active. In this case G(i,j) = 1 implies that i activates j with a probability 

^_ Pthresh- 

— Define cover(G,S) = U xe s(N(x) U x) where S C V and N(x) denotes the 
neighbours of x. 

— Let C*(G,k) denote the maximum number of nodes in graph G covered 
by selecting k covering nodes i.e. C*(G,k) — maxg | cover(G,S) | where 
\S\=k. 

— Let Sjf denote the set of k nodes (Sjf C V) chosen according to the greedy 
algorithm. 

For a graph G = (V,E) partitioned into two subgraphs G\ = (V\,E\) and 
G 2 = (V 2 ,E 2 ),lct 

— the maximum node degrees for the graphs G\,G 2 and G be d\,d 2 and d 
respectively. Let 51,32 and g be the sizes of the greedy set covers (i.e. covers 
obtained by the greedy approach) of Gi,G 2 and G respectively. Let c* =| 
S^ |, c 2 =| S 2 I and c* —\ S* \ he the sizes of the minimum dominating sets 
of G\ , G 2 and G respectively 

— Define the vertex support of the cut as the total number of nodes in cither 
partition connected to nodes in the other partition in the original graph, i.e. 
if CS = {s,t} € E,\ s E Vi,t E V 2 denotes the cut set of the partition, and 
P = Us , Q = Lit are the sets of nodes in G\ and G 2 respectively connected 
to nodes in the other partition, then 

vSupCut(V u V 2 ) =| P U Q |=| P I + I Q I (1) 

— Define the minimum vertex support cut of a graph G = (V, E) as 

minVSupCut(G) = miny^ vSupCut(V u V 2 ) Vi + </>, V 2 ^ 0, V 1 n V 2 = V 



2 The Minimum Dominating Set and Maximum 
Coverage in a problem decomposition context 

We define performance ratios for the dominating set and maximum coverage 
problems as follows. Define x^ opUma ^ as the ratio of the solution obtained by 
combining the optimal solutions from the individual subgraphs to the optimal 
solution obtained from the original graph and let 

(optimal) _ rn l n (X^ 0pti ' mal ^ 1/X (°P timal )\ (2) 

Since computing the optimal solutions in the case of minimum dominating set 
and maximum coverage is NP-hard, x^ optlmal ^ and p(°P tlmal ) cannot be com- 
puted directly for large graphs. Thus we also define an "empirical" performance 
ratio p(9 reed v) as the ratio of corresponding empirical quantities computed us- 
ing the greedy algorithm outlined in Cormen et al. [2]. Let G = (V,E) be the 
graph partitioned into G\ = {V\,E\), . . . , G p = (V p , E p ) {V\ U V 2 . . . V p = V and 
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ViHVj = <j>,i ^ j). For the minimum dominating set and the maximum coverage 
the corresponding expressions are as below, these are discussed in a little more 
detail in the appendix. 

(optimal) -. / y (optimal) I *^ I /q\ 

Pmds -t/^MDs - | s , | + | g* | . . . | S ; I [) 

f .. n , .. n I coverfG, SI U . . . St ) \ 

{optimal) -^(oprzmeu) ' \ ' «i re p / I • .% 

%c = MC = \ cover (G,S*)\ [) 

For p = 2 partitions, the optimal and empirical performance ratios for the min- 
imum dominating set are: 

(optimal) _ c , (greedy) _ 9 ( r\ 

Pmds ~ ~I T „* ana Pmds - " T " l°J 



where the symbols are according to notation in section 1.1. Analogous to an 

(greed'. 
°MDS 



empirical performance ratio for the Minimum Dominating Set (Pmds ) we 



define one for the maximum coverage problem as : 

greedy) I cover (G 1 , sff ) | + | cover (G 2 , S% ) 
cover (G,S { k 9) ) \ 



(greedy) _ I V^JJ "fa ; I ' I v ^ z , ^ fca ; , 



2.1 Relating performance ratios to empirically computable 
quantities 

In this section we state propositions relating the performance ratios to quan- 
tities that can be empirically computed for a given graph (and its subgraphs), 
the vertex support of a cut is one such quantity that appears in two of these 
propositions. 

Minimum Dominating Set (MDS) While computing the minimum domi- 
nating set is NP-hard in general, bounds may be derived for the performance 
ratio in terms of quantities that can be computed empirically for a given graph. 
We state the following propositions, the proofs are given in the appendix. 

Proposition 1. Let G = (V,E) be partitioned into subgraphs G\ = (Vi,Ei) 
and Gi = (V2, E%) Then 

-'- - (optimal) 



vSupCut(V 1 ,V 2 )H(d) 

g 



- Pmds - l 



Proposition 2. 



(greedy) 
PMJg_ < p (opUmal) < ^^ greedy) R (rf)) (?) 

where P^ds V = — + — an< ^ Pmds 111 an( ^ H(d) are as explained earlier. It can 

also be shown that a tighter upper bound on /Jm/iT" * s ^1 ? ^2 — ■ 

757317 + H{d 2 ) 
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Maximum Coverage Given graph G and a budget k, we now express lower 
bounds on the performance ratio p^Q ma (see (13)) as given by propositions 
3 and 4. As before let G = (V,E) be partitioned into G\ = {V\,Ei) and G2 = 
(V 2 , E 2 ). Let S^ be the optimal k-node cover for G. Note that for the max cover 
problem there is an additional criteria, namely the quota of nodes allocated to 
G\ and G 2 , let the number of nodes be k\ and k 2 = k — k\ respectively. 

Let S$ (S { k f C Fi) be the set of fci nodes in Gi and SJfJ be the set of 
k 2 = k — k\ nodes in G 2 both chosen according to the greedy algorithm. Let 
S 1 = S'l n Vi and S 2 = S% fl V 2 be the subsets of SI present amongst V\ and V 2 
respectively. We consider the special case where ki ~\ S 1 \ and ki =| S 2 | i.e. 
the "budget allocation" to the subgraphs happens to be exactly right by chance. 
(Note that in practice we do not know | S l | and | S 2 |). Then the following 
relationships given by propositions 3 and 4 hold. 



Proposition 3. 



Proposition 4. 



vSupCut{Vi,V 2 ) (optimal) (Q , 

TT^i^V - Pmc [ ' 

cover(G,b k ) 



P^S edv) ^-^M<P { Mc mal) (9) 



where p^ e is as defined in 6. 

Summary and implications of Propositions 1-4 Proposition 2 and 4 

. t -1 * , . n r . . c (optimal) t (optimal) 

express the bounds on optimal pcrtormancc ratios tor p MDS and p MC 
in terms of the sizes of greedy set covers of the subgraphs and the full graph. 
Propositions 1 and 3 express the lower bound on the performance ratios for 
the minimum dominating set and maximum coverage problems respectively, in 
terms of the vertex support of the cut. The analysis of the maximum coverage is 
more difficult since it depends on how the total quota of nodes (" the budget" ) is 
divided amongst the subgraphs. Proposition 3 holds only in the special case when 
the actual allocation of nodes to each subgraph is same as the optimal allocation 
(by chance). Note that propositions 1 and 3 imply that minimising vSupCut i.e. 
minVSupCut(G) improves the lower bound on the optimal performance ratios 
(although this does not necessarily guarantee an optimal partitioning). 

It can be easily shown that minV SupCut(G) is analogous to the well known 
min cut (we omit this due to space constraints) . Although it may be possible to 
achieve high performance ratios with the min cut, a common problem is that it 
tends to favour unbalanced partitions, while it is often desirable to have balanced 
partitions. We subsequently report results obtained from partitioning the graphs 
using the min cut, ratio cut and normalised cut; the latter two tend to produce 
more balanced partitions. 

3 Experiments 

We illustrate a few representative results on synthetic and real datasets, the 
graphs varied in size with the number of nodes ranging from a few hundred to a 
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few thousand. For the real data we used co-citation graphs from bibliography 
of fifteen years of Case Based Reasoning (CBR) conferences 1 For the 

synthetic data we generated power law graphs, they are described briefly below 
and in detail in the appendix. 

The basic statistics of the graphs used in our experiments are summarised in 
table 1. 

Powerlaw graphs In order to construct synthetic graphs that approximate 
real world power-law graphs, random graphs with a given fixed input degree 
sequence (di > d%, . . . , d„) were generated and the degree sequence was chosen 

to follow an exact power law as given by : dj~ — round I (— J d m i n J where 

dmin = minimum node degree 

We generated two sets of powerlaw graphs for our experiments, the parameter 
values used were (a = 5,d m i n — 8) and (a — 10 ,d m m = 8). For each of these 
sets, power law graphs consisting of different numbers of nodes (n = 1000, 2000, 
5000 and 10000) were generated. For each value of n (number of nodes) 50 
powerlaw graphs were generated and the experiments were repeated on each of 
the 50 graphs. The min cut, the ratio cut and normalised cuts are commonly 
used graph cuts, also note that minSupCut and NVSupCut are closely related 
to the min cut and ratio cut respectively as described in the appendix. While the 
min cut achieves high performance ratios as defined in (5) and (6), a common 
problem is that it often produces highly unbalanced partitions, this is evident 
in table 2(a) for instance. The ratio and normalised cuts are preferred in many 
applications since they produce balanced partitions. 

Also we observed in our experiments (see the corresponding rows in table 
2(b,c) for instance) that the ratio cut may sometimes produce low empirical 
performance ratios, on the other hand the normalised cut consistently yielded 
high performance ratios including in those cases where the ratio cut performed 
poorly. For these reasons we mostly report results corresponding to the nor- 
malised cut. 

We used the software GMINUCUT which implements the algorithm pro- 
posed by Hao and Orlin [5] to compute the min cut. While computing the ex- 
act optimal ratio and normalised cuts are NP-hard, relaxations of these cuts 
can be formulated as eigenvalue problems. For the normalised cut we used the 
software Graclus which implements the multi- level approach proposed in [3], 
one main advantage of this software is that it can compute the normalised cut 
without a computationally intensive eigenvalue decomposition. We describe ex- 
perimental results pertaining to computation of empirical performance ratios 
from subgraphs obtained by partitioning the original graph. The results shown 
in tables below correspond to p = 2 partitions, the empirical performance ra- 
tios {pmds an d Pmc ) are re P or ted in the last columns of the tables. For 
the real graphs the empirical performance ratios were computed using the ac- 
tual empirical quantities for the corresponding graphs, for the power law graphs 



1 publicly available for research purposes, can be downloaded from 
http://mlg.ucd.ie/cbr 
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# nodes 


a = 5, dmin = 8 


a — 10, dmin = 8 


Average 
# edges 


Average 
highest 

degree 


Average 
# edges 


Average 

highest 

degree 


1000 


4978 


32 


4417 


16 


2000 


9965 


37 


8837 


17 


5000 


24922 


44 


22095 


19 


10000 


44856 


50 


44195 


20 



(a) 



Graph 
description 


# nodes 


# edges 


CBR 

Co-citation (Small) 


247 


8055 


CBR 

Co-citation (Full) 


657 


24682 



(b) 



Table 1. Basic statistics of simulated power law graphs and real graphs (the largest 
connected component in each case) used in our experiments, (a) Power law graphs 
generated according to (40) Columns 2-3 : d min — 8, a — 5 Columns 4-5 : d min = 8, 
q = 10. (b) Real graphs derived from bibliographic datasets 



the values averaged over 50 runs were used (for example for the MDS problem 

— for power law 



(greedy) 

Pmds 



— g_ 



gi+92 



for the real graphs and Pmds 



_ 2_ 



9 { r a) +& 



graphs) . For the maximum coverage problem, the number of covering nodes al- 
located to each subgraph was proportional to the size of the subgraph (number 
of nodes in the subgraph) . Assume G — (V, E) is partitioned into subgraphs 
d = (Vi,£i) and G 2 = (V 2 ,E 2 ) {V 1 \JV 2 = V). Let k be the number of covering 
nodes for the full graph. The number of covering nodes allocated to subgraphs 
G\ and G 2 were k\ = round(k \ V\ \ / \ V |), k 2 = round(k \ V 2 \ / \ V \ where 
| V | , | Vi | , | V 2 | arc the number of vertices in graph G and subgraphs G\ and 
G 2 respectively. 

CBR graphs Table 2 shows the results corresponding to CBR graphs. Table 
2(a) shows number of nodes in the p = 2 subgraphs obtained by partitioning the 
graphs using the min, ratio and normalised cuts respectively, the min cut often 
produces unbalanced partitions as evident from the table. Tables 2(b) and 2(c) 
show the results pertaining to the CBR graphs corresponding to Pmds an( ^ 
Pmc? respectively. While the min cut yields high performance ratios, this is 
due to highly unbalanced partitions, (for example in each of the CBR graphs one 
of the partitions consists of a single node) . 

Powerlaw graphs Tables 3 and 4 show the results pertaining to p M r ^ s and 
Pmc? f° r g ra P ns consisting of 1000 and 2000 nodes using the ratio and nor- 
malised cuts respectively . For each number of nodes (say 1000), the results 
averaged over 50 runs are shown. The numbers shown in the columns except the 
first and the last are the mean (standard deviation) of the respective quantities. 
The normalised cut provides good emprical performance ratios in most cases. 
These are shown in the last columns of the respective tables. 

We observe from these tables, that for the power law gaphs the performance 
ratios obtained using the ratio cut were very similar to those obtained using nor- 
malised cut; however for the CBR graphs the performance ratios corresponding 
to the ratio cut were inferior to those corresponding to the normalised cut. In 
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Graph 


Cut 


m 


n 2 


CBR 

Cocitation 

(Small) 


Min 


246 


1 


Ratio 


123 


124 


Normalised 


102 


145 


CBR 

Cocitation 
(Full) 


Min 


656 


1 


Ratio 


328 


329 


Normalised 


288 


369 



Graph 


9 


Cut 


ffi 


92 


(greedy) 

Pmds 


CBR 

Cocitation 

(Small) 


11 


Min 


11 


1 


0.9167 


Ratio 


19 


2 


0.5238 


Normalised 


7 


6 


0.84615 


CBR 

Cocitation 

(Full) 


27 


Min 


27 


1 


0.9643 


Ratio 


5 


66 


0.3802 


Normalised 


13 


20 


0.8181 



(a) 




(b) 








Graph 


Full graph 


Cut 


Sub graph 1 


Sub graph 2 


(greedy) 

Pmc 




(k, 

\C(G,si s) )\) 




(*i, 

\C(G U S%>)\) 


(to, 

,|C7(G 2 ,^)|) 




CBR 

Cocitation 
Small 


(10,246) 


Min 


(10,246) 


(0,0) 


1 


Ratio 


(5,41) 


(5,115) 


0.6341 


Normalised 


(5,125) 


(5,117) 


0.9837 


CBR 

Cocitation 
Full 


(10,619) 


Min 


(10,619) 


(0,0) 


1 


Ratio 


(5,328) 


(5,174) 


0.8109 


Normalised 


(6,360) 


(4,240) 


0.969 



(c) 
Table 2. Results for CBR graphs, partitioned using min cut, ratio cut and nor- 
malised cut respectively, (a) Sizes of subgraphs obtained from partitioning (b) 



(greedy) _ 
l J MDS ~ 



91+92 



/ \ \gre 

(c) Pmc 



(greedy) 



\C(G 1 ,SJ g i ) )\ + \C(G 2 ,si g 2 ) )\ 
|C(G,S< 9) ) 



almost all our results (may results omitted for the sake of conciseness) we found 
that the normalised cut yielded good performance ratios. 

4 Conclusions and future work 

In this work we examine the minimum dominating set and maximum coverage 
problems, in a problem decomposition context. We posed the question that given 
a graph and subgraphs obtained by partitioning it, how close is the combined 
solution from the subgraphs to that which can be obtained from the original 
graph. Towards this end we define optimal and empirical performance ratios as 
benchmarks for assessing the quality of a given partitioning. We address the 
above question both analytically and experimentally and provide a few initial 
insights. For the case of two partitions we show that a minimum vertex support 
cut maximises a lower bound on the corresponding empirical performance ratios, 
for the minimum dominating set and also a special case of the maximum cov- 
erage. Future work includes obtaining tighter bounds on "optimal performance 
ratios" . Another direction is investigating different graph partitioning schemes. 
In our experiments the normalized cut consistently provided a high empirical 
performance ratio, we suggest that it may also be a "good partitioning" with 
respect to the optimal performance ratio, deriving analytical results is another 
interesting question for future investigation. 



Dept of CSE, PESIT, Bangalore South Campus 



Proceedings of RETCOMP - 2013, Volume 1 



Graph 
Parameters 


Full graph 


Sub graph 1 


Sub graph 2 






# nodes 


Mean (std) 
9 


Mean (std) 
9i 


Mean (std) 

92 


(greedy) 
l J MDS ~ 




9 (-..) +fl (.««) 




a — 5, 

Ornin — o 


1000 


154.92 (2.75) 


101.22 (2.89) 


101.98 (2.85) 


0.7627 




2000 


310.68 (3.80) 


203.45 (4.24) 


201.70 (4.06) 


0.7663 




a= 10, 

Ornin — O 


1000 


169.92 (2.11) 


107.84 (2.66) 


108.5 (2.93) 


0.7856 




2000 


339.48 (2.89) 


215.72 (3.26) 


215.8 (3.74) 


0.7868 




Graph 
Parameters 


Full graph 


Sub graph 1 


Sub graph 2 


(greedy) 
l J MC ~ 


# nodes 


k 


Mean (std) 
\C(G,si 3) )\ 


Mean (std) 
\C{G U S^)\ 


Mean (std) 
\C(G 2 ,Sil ) )\ 


a = 5, 

Ornin — o 


1000 


10 


179.36 (5.43) 


68.92 (5.66) 


68.04 (5.52) 


0.7641 


100 


899.84 (5.22) 


399.30 (5.78) 


398.58 (5.11) 


0.8867 


2000 


10 


205.60 (6.14) 


75.03 (5.84) 


76.30 (6.43) 


0.7362 


100 


1202.25 (7.40) 


505.30 (7.56) 


506.40 (8.21) 


0.8415 


a = 10, 

Ornin — O 


1000 


10 


133.88 (2.30) 


55.85 (2.26) 


54.73 (1.71) 


0.8261 


100 


856.15 (3.39) 


383.00 (4.44) 


381.43 (3.27) 


0.8929 


2000 


10 


141.62 (2.43) 


57.92 (2.36) 


58.75 (2.16) 


0.8240 


100 


1065.65 (5.88) 


463.82 (4.78) 


466.02 (5.19) 


0.8726 



(a) 



(b) 



Table 3. Empirical performance ratios for power law graphs partitioned using the 
ratio cut. Table (a) : {Pmds ) 



Table (b) 



(greedy) 

Pmc 
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Graph 


Full graph 


Sub graph 1 


Sub graph 2 




parameters 


# nodes 


Mean (std) 
9 


Mean (std) 
9i 


Mean (std) 

92 


(greedy) 
l J MDS ~ 


„(-«», +B (.«.) 


a — 5 


1000 


154.92 (2.75) 


99.52 (7.33) 


103.50 (6.88) 


0.7633 


2000 


310.70 (3.80) 


200.48 (15.00) 


208.92 (13.69) 


0.7591 


a = 10 


1000 


171.92 (2.18) 


106.56 (7.06) 


105.52 (6.27) 


0.811 


2000 


332.12 (3.55) 


217.720 (8.45) 


215.68 (8.45) 


0.766 


Graph 
parameters 


Full graph 


Sub graph 1 


Sub graph 2 


(greedy) 

Pmc 


# nodes 


k 


Mean (std) 
\C(G,si 9) )\ 


Mean (std) 
\C(G 1 ,S^ ) )\ 


Mean (std) 
\C(G 2 ,S^)\ 


a = 5 


1000 


10 


179.36 


74.06 


65.16 


0.7763 




100 


899.84 


391.94 


382.06 


0.8601 


2000 


10 


205.42 


82.3 


73.46 


0.7588 




100 


1202.48 


495.48 


481.9 


0.8128 


a = 10 


1000 


10 


133.54 


54.8 


54.52 


0.8188 


100 


855.8 


374.82 


366.58 


0.8663 


2000 


10 


141.64 


58.98 


58.96 


0.8329 


100 


1065.22 


449.02 


446.36 


0.84059 



(a) 



(b) 



Table 4. Empirical performance ratios for power law graphs partitioned using the 
normalised cut. Table (a) : (p^^g ) Table (b) : p^^T 
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Appendix 

Denning performance ratios for the dominating set and 
maximum coverage problems 

Define x^ optlmal ^ as the ratio of the solution obtained by combining the optimal 
solutions from the individual subgraphs to the optimal solution obtained from 
the original graph and let 

( optimal) _ m i n <X (°P timal ) l/^(optimal)\ qq\ 

Since computing the optimal solutions in the case of minimum dominating set 
and maximum coverage is NP-hard, x^ optlmal ^ and p(°P tlmal ) cannot be com- 
puted directly. We also define an "empirical" performance ratio p(s reed v) as th e 
ratio of corresponding empirical quantities computed using the greedy algorithm 
outlined in Cormcn et al. [2]. 

Let G = (V,E) be the graph partitioned into G\ = (Vi, E\ ),..., G p = 
(V p , E p ) (Vi U V 2 . . . V p = V and Vi DVj = </>,i ^ j). The expressions for the 
Minimum Dominating Set and Maximum Coverage partitions are as follows: 

1. Minimum Dominating Set Let S*, S%, ..., S* and S* be the optimal 
(smallest) dominating sets of G\ and G2, ■ ■ ., G p and G respectively. Define 

X MD™ al) = lS1U f s :: US;l Note that since Sf, S%, S* are disjoint sets and 



^-(optimal) ^ -. 
A M£>S — i ' 



s* 



(optimal) * > *r(optimal) I " I /-i -i \ 

Pmds =i/*mds - 1 s* 1 + j s* 1 ... 1 S ; I [ ' 

For p = 2 partitions, the optimal and empirical performance ratios are: 

(optimal) _ c , (greedy) _ 9 ,~ „x 

Pmds - „* TTi ancl Pmds ~ ~T~TZ ( iZ ) 

c i + c 2 3i + .92 

where, c| =| SJ |, c\ =| S*| | and c* =\ S* \ are the sizes of the minimum 
dominating sets of G\ , G 2 and G respectively and g\ , g 2 and g are sizes of 
dominating sets obtained by the greedy algorithm Note that while Pmds 111 
is always < 1, p M T pg could exceed 1. 

Maximum coverage For the maximum coverage, the given inputs are the 
graph G and a budget k. There is an additional criterion, namely the alloca- 
tion of budgets i.e. the number of covering nodes allocated to G\ , G 2 , . . . , G p 
let these be fci, k 2 , ■ ■ ■ , k p respectively (Y^ h — k). Let S%. C Vi, . . . , S% C 
V p be the sets of nodes which form the optimal k\ , . . . , fc p -element optimal 
covers in G\, . . . , G p respectively. Let S£ be the optimal fc-element cover for 
G. 
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We define the objective function as, X^ > c ima = \ CO ver(G s*)\ kp ' ^ a - e corre ~ 

sponding performance ratio is : 

i *■ i\ i *■ r\ \cover(G,ST U . . . St ) I 

{optimal) y{optimal) I ^ 7 Ki k p / I /-. q\ 

Pmc -*mc - \ cover (G,S* k )\ [V 

Note that p^c/™" (= Xj^-, ma ) < 1, since S k is the fc-element set of nodes 
that covers the largest number of nodes in G. Note that p^ P c ima depends on 
not just the graph but on k and k±, . . . , k p , strictly speaking it must be denoted 

by Pmc" 111 (*-*> k,k\, . . . , k p ); however for simplicity of notation we drop fcs. 



Proof of propositions outlined in the paper 

We prove the propositions stated in the paper. The proofs use the following 
results in the literature, we briefly outline them here . 

Minimum Dominating Set : Let c* be the size of an optimal (the smallest) 
dominating set and g the size of the dominating set obtained using the greedy 
algorithm. It can be proved that the greedy set cover algorithm returns a set 
cover whose size is at most H(d max ) times the size of an optimal (the smallest) 
set cover where d max is the cardinality of the largest set and H{d) is the dth 
harmonic number. The reader is referred to [2] for a description and related 
analysis. This result in [2] expressing the size of the greedy set cover in terms of 
the optimal set cover can be written as c* > jS-rr where d is the highest degree 

and H(d) = £ti l/». 

Maximum Coverage : Ncmhauscr et at [7] show that the greedy algorithm 
provides a value within (1 — 1/e) of the optimal for sub modular functions. This 
result can be expressed as : 



| cover (G,S ( k 9) ) |> (1 - 1/e) | cover(G,S* k ) \ (14) 

where cover(., .), Sjf and S k are described in section 1.1. 
Proof of proposition 1: 

Proof. To prove the above proposition we derive bounds for x^ optlmal " > = Cl f 2 

(note that for the MDS problem p^pT^ = 1 1 X {opUmal) ■ It is obvious that the 
smallest possible value of x^ optlmal ^ is 1 since by definition c* is the optimal 
dominating set of the whole graph. Hence x < ■ opt^mal " , > 1. 

We derive the upper bound on x( optlmal ^ as below. 

— Let S* be the set of nodes of the minimum dominating set. 

— Let S and S be the nodes of S* present in G\ and Gi respectively i.e. 
S' =S*DV 1 ,S" =S*nV 2 and S' US" =S*).Letc =\ S' | and c" =| S" |. 
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— Let P and Q be the sets of all nodes in G\ and G 2 connected to nodes in the 
other partition. Let | P |= p and | Q |= q i.e. p nodes of G\ arc connected 
to q nodes of G 2 in the original graph G. 

All nodes in G\ except possibly the set of nodes P are covered by the set of 
nodes S since none of those are connected to any node in G2 ■ Thus S U P is a 
dominating set of G\. Hence c* <| S \ + \ P | i.e. 

4<c'+p (15) 

Similarly S U Q is a dominating set of G 2 and thus : 

c* < c" + q (16) 

From 15 and 16 and noting that c + c = c* , we get 

c\+ c* 2 < c + p + c + q 

= c*+p + q (17) 

Noting that p + q = vSupCut(Vi, V2) we have 

x (o P u ma i) = c i + C 4 < 1 | vSupCut(Vi,V 2 ) < 1 | vSupCut(V 1 ,V 2 )H(d) 
c* ~ c* g 



since c* > „ 9 , ,. . Taking the reciprocals of the relevant quantities we get the 

i n (optimal) 

bounds on p MDS 

Proof of proposition 2: 



Proof. Since c* is the size of the optimal (smallest) set cover, it is less than the 
sum of the optimal set covers of the sub graphs (true by definition). Hence, 

c* <c{ + c* 2 (19) 

Thus 1 is a trivial upper bound on Pm^™ 11 = c * c +c * ■ 

Let LBi, LB2, LB be lower bounds for c*, c\ and c* respectively. Similarly 
let UBi, UB2, UB be upper bounds. It is easily seen that 

LB c* UB 

< < (20 



UBi + UB 2 ~ cl+c* ~ LBi + LB 2 

We note that the sizes of the optimal set covers are at most the sizes of the 
corresponding greedy set covers. Hence we have, 

c\ < ffi, c* 2 < g 2 and c* < g (21) 

Also from the basic result on the size of the greedy set cover ([2]) we have 

" ! <4, j^ < c * and -f- < c* (22) 



H(d!) - *' H ^ -" 2<111U H(d) 
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From 21 and 22 we have, 



9i 



LBl= HWy UB ^=9iLB 2 = 1 ^, UB 2 =g 2 LB = j^- y UB = 03) 

Substituting for the lower and upper bounds from 23 in 20 we get the required 
result, i.e. 

c* 
< 





9 




91 


+ 


92 


ff(di) 

C* 


H(d 2 ) 



a 

and im < — (24) 

3i + 92 ~ c\ + c* 

The expressions for the lower and upper bounds can be simplified as follows. 

Note that d > d\ and d > d 2 and hence H(d) > H(d\) and H{d) > H(d 2 ) since 

H(.) is monotonic. Replacing H(d) in place of H(d\) and H(d 2 ) in 24 gives a 

simpler (and a more conservative) expression for the lower bound as gl "hT3T ' 

since 

9i 92 9i + 92 



H(d!) H{d 2 ) ~ H(d) 
Using the more conservative lower bound, (24) can be written as : 

{greedy) * 

Pmd \ < — < H(d)p ( ° r °i v) (25) 

H(d) ~ cl + c* ~ y >HMDS v ; 

where 

{greedy) 9 

Pmds ~ 



9i + .92 
and H (d) is as explained in the paper. Thus 

{greedy) 
PMDS ^ (optimal) ^ ■ ( , (greedy) tt ( ,\\ 

H r d \ <PmdS <min{l,p MDS H(d)) 

since 1 is a trivial upper bound. 

Propositions 3 and 4 

For the proofs of propositions 3 and 4, as before let G = (V, E) be partitioned 

into G\ = (V\,E\) and G 2 = (V 2 ,E 2 ). Let S* denote the optimal k node cover 

of G. Let S[ — S* n Vi and S 2 = S* D V 2 be the subsets of nodes of S* present 

in graph G\ and G 2 respectively. Let S* and S 2 be the optimal max covers for 

graphs G\ and G 2 . Note that for the max cover problem there is an additional 

criteria, namely the quota of nodes allocated to G\ and G 2 (i.e. | S^ \ and | S 2 |). 

Proof of proposition 3 

Consider the special case where | S{ \ — \ S x \ and | So |=| S 2 \. 

(Note that in practice we do not know | S x \ and | S 2 |)- 

Let S^ , S{ 9 and S 2 9 ' be the sets of nodes from graphs G\ and G 2 obtained by 

the greedy algorithm, where | S^ |= k, | S^ |=| S[ | and | 4 9) |=| ^ |. 
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It is easy to see that the upper bound for /0^/ ma is 1 since S* is the optimal 
k node cover for G. We derive the lower bound on p^ c ima as follows. 
It is easily seen that for any set of nodes X and Y the following holds. 



cover 
cover 



(d , X) = cover(Gi -P,X)U cover(P, X) 
•{G 2l Y) = cover(G 2 -Q,Y)U cover(Q, Y) 



(26) 



where - denotes set difference. 

As before let P be the set of all nodes in G\ which are connected to one or 
more nodes in G 2 in the original graph G and let Q comprise the set of nodes 
in G 2 connected to one or more nodes in G\. From 26 and noting that G\ — P 
and P are disjoint sets we have, 



cover{G u X) |=| cover{G x -P,X) 



cover{P, X) 



Using this we write down the following relationships as below. 



coverid -P,^) | 
cover(G 2 — P, S 2 ) | 
cover (Gi -P,S*) \ 
cover{G 2 -P,S 2 ) \ 



cover {G\, S x ) 
cover (G 2 ,S 2 ) 
cover (G±, S%) 
cover {G 2 , S 2 ) 



cover (P, S ± ) 
cover (P, S 2 ) 
cover{P 1 S{) 
cover (P, S 2 ) 



(27) 



We have, 



cover{G, S± U S 2 ) 



Similarly, 



cover {Gi — P, S ± ) \ + | cover(G 2 — Q, S 2 ) \ 
cover{P UQ,^U S 2 ) | (28) 



| cover(G, S{ U S 2 ) | = | covered - P, S*) \ + \ cover{G 2 - Q, S 2 ) 
+ | cover(PllQ,S*USZ) \ 

Substituing from 27 in 28 and 29 we get : 



(29) 



cover (G, S 1 U S 2 ) \ — \ cover (Gi, S-J | — | cover (P, S^) 
+ | cover(G 2 ,S 2 ) \ — | cover(Q,S 2 ) 
+ | cover (P U Q, s[ U S 2 ) \ 



(30) 



cover(G, S± U S 2 



cower(Gi,5J) | - | cover{P,Sl) 
cover(G 2l S 2 ) \ — \ cover{Q,S 2 ) 
cover(PuQ,SlUS 2 ) \ 



(31) 
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(30) - (31) gives : 

| cover{G, S[ U S 2 ) \ - \ cover(G, SI U S%) \ = ti + h + 1 3 - U (32) 
where, 

t\ = | cover(Gi,S 1 ) \ — \ cover (Gi,S%) | 

t 2 = | cover(G 2 , S 2 ) | - | cover(G 2 ,S 2 ) \ 

t 3 = | cover{P UQ,S[u S 2 ) | -(| cover(P, S[) \ + | cover(Q, S 2 ) |) 

ti = | cover(P U Q, S{ U S%) | -(| cover(P, Sjf ) \ + \ cover{Q, S%) |) 

— Since S^ constitutes the optimal k\ node cover for Gi we have 

ii =| cower(Gi,5' 1 ) | - | c<wer(Gi, SJ) |< 

— Similarly t 2 < 

— £4 > since S% U 5^ is guaranteed to cover at least as many nodes inFUQ 
as the sum of nodes covered individually by S^ in P and S 2 in Q. 

— £3 can at most be the total number of nodes in P U Q. Hence 

t 3 <PUQ=\P\ + \Q\ 

since P and Q are disjoint sets. 

Plugging in the separate maximum values for £1, £2, £3 and the minimum value 
for £4 in (32) we get 

cover (G, S[uS' 2 )\-\ cover (G, S* U S 2 ) \< p + q (33) 

where p =\ P \ and q = | Q |. Noting that S x U S 2 is indeed S* and dividing 
both sides of (33) by | cover (G, S*) | we have : 

_ \ cover (G, SI US%)\ p + g 

I cover (G, S*) \ ~ \ cover (G,S*) \ [ ' 

We make two minor observations. 

— The LHS of (34) is 1 - p { °^ mal) . 

— Also I cover (G,S*) \>\ cover(G,S^) |, hence 

p+q p+q , 



I cover{G,S*) \ ~ \ cover(G, S^)) 
Therefore we have, 



1 yvpi 

1 — Pmc 



(optimal) ^ V * Q ~> 



I cover{G,S*) | " | cover{G,S^)) \ 
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and hence 



P + Q < (optimal) 

cover(G,S(3))\ ~ Pmc 



thus proving the proposition. 
Proposition 4 

Let the number of nodes to be selected from the full graph be k. Let the quota of 
nodes allocated to subgraphs G\ and G 2 be k\ and k 2 respectively (k\ + k 2 = k). 
Let S^ , S{ 9 and S 2 be the sets of nodes from graphs G\ and G 2 obtained by 
the greedy algorithm, hence | S^ \= k, \ S{ 9 | = k\ and | S 2 \= k 2 - Define 
Pmc as : 

_ | cover{G\,S{) | + | cover (G 2 , 5*2 ) | 
PMC ~ | cover (G, S*) \ 

We note that 

| cover (d, SI) \ + \ cover{G 2l S* 2 ) \<\ cover(G, St U SJ) | (35) 

since S± U S 2 is guaranteed to cover at least as many elements in G as the sum 
of number of nodes covered separately in G\ and G 2 by S\ and S 2 respectively. 
Dividing both sides of (35) by | cover(G, S*) | we get puc < Pmc™^ • Hence 
a lower bound on pmc is also a lower bound (a more conservative one) on 

(optimal) 

Pmc 

Recall the expression below which is a consequence of the well known result 

on sub modular functions by Nemhauser et al. [7]: 



cover (G,S^) \<\ cover (G,S*) |< ' ^ \f^ 
Similarly for the two subgraphs G\ and G 2 we have, 



(36) 



i <n e(sh i^i in q*\ \s I cover{G 1 ,s[ 9) ) \ 

| cover(Gi,Sl y ') \<\ cower(Gi,5 1 ) |< , (37) 

i tn e(sh 1^1 m q*\ \s I cover ( G 2, s!f ] ) \ 

| cover {G 2 ,S y 2 J >) \<\ cover(G 2l S 2 ) \< — (38) 

1 — 1/e 

Similar to the proof of proposition 2, the above relations may be written as : 

LBi < | cover{G u St) \ < UB X 

LB 2 <\cover{G 2 ,S* 2 ) \ <UB 2 

LB < | cover(G,S*) \ < UB 

where LBi,LB 2 , LB are the lower bounds on cover(G\, S^), cover(G 2 ,S 2 ) and 
cover(G,S*) respectively. As in (20) we get : 

LB 1 + LB 2 | cover{G u St) \ + \ cover{G 2l S$) \ 



UB \cover{G,S*)\ 
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Plugging the lower bounds from (37), (38) and upper bound from (36) we get: 

(1 - \le)ptc dV) < PMC 

\cover<G 1 .S ( . 9) )\ + \cover<G 2 .S^ ) )\ a- ~ ^ (optimal) , ,, 

since pmc = J — ; * ' (gh — - J - 2i - Since p MC < p MC wc have the 

desired result, namely 

(1 - IMPmT^ < P^C™^ 

Description of graphs used in our experiments 

Synthetic graphs : Power law graphs 

It is often observed that for many real graphs the degree distribution follows 
a power law rather than a Poisson distribution [1]. Power law graphs are char- 
acterised by a significantly higher probability of nodes with large degrees as 
compared to a random graph. These nodes represent hubs which have a signif- 
icant impact on overall connectivity of a graph, since a connected set of hubs 
can provide short paths between a large proportion of nodes in the network. 
Other characteristics often associated with power law graphs are high cluster 
coefficient and a scale-free property i.e. an observation of certain properties at 
different scales. 

The degree sequence of a graph is the set of node degrees (di, d,2, ■ ■ ■ ,d n ) 
ordered such that di > . . . > d n . To construct synthetic graphs that approximate 
real world power-law graphs, random graphs with a given fixed input degree 
sequence were generated and the degree sequence was chosen to follow an exact 
power law as given by : For k — 1 , . . . , n, node degrees dk were chosen such that 

d k =round(ck a ) (39) 

The constant of proportionality c was set by specifying the minimum degree 
d n = d min , this yields 

dk = round (( — I d m i n I where d m i n = minimum node degree (40) 

Real-world graphs do not exhibit the perfect power-law characteristics of 
these synthetic graphs, but rather tend to exhibit a power-law behaviour in 
certain ranges of node degrees. Nevertheless, this is a useful approximation that 
allows us to examine algorithms on reasonably large-scale graphs which exhibit 
some of the characteristics shown by real- world graphs. 

Real graphs : Graphs from bibliographic data 

CBR dataset 

It is often argued that co-authorship networks capture many key features of social 
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networks in general(eg. [8]). Also co-citation analysis has become the dominant 
method for the empirical study of the structures of scientific communication [6] . 
We used graphs derived from bibliographic data for our experiments, these are 
described below. In most of our experiments the edge directions and weights were 
ignored and the largest weakly connected component was used. The Case Based 
Reasoning (CBR) co-citation dataset was collected from bibliography of fifteen 
years of CBR conferences, the aim was to examine the themes that have evolved 
in CBR research as revealed by the implicit and explicit relationships between 
the conference papers, the interested reader is referred to [4] for more details. 
The CBR cocitation dataset consists of a total of 672 papers from the CBR 
conference series, published by 828 individual authors. It is publicly available 
for research purposes 2 . We refer to this dataset as CBR Cocitation (Full). 
We also performed experiments on a subset of the CBR dataset this consisted of 
papers only within the CBR conferences, we refer to this as CBR Cocitation 
(Small). 



can be downloaded from http://mlg.ucd.ie/cbr 
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Abstract 

A localized radial basis function (RBF) meshless method is developed for solv- 
ing laminar natural convection problems in enclosures. The developed scheme is 
based on computing the weights for the derivatives by localizing the RBF inter- 
polation concept. This localization approach helps to control the instability due to 
ill-conditioning of the RBF interpolation matrix. The governing equations are con- 
sidered in the velocity-pressure formulation, where the decoupling of momentum 
and continuity equations are obtained using a fractional step algorithm. Numerical 
results are presented for the benchmark problems such as: (1) natural convection of 
air in a square cavity and (2) natural convection in an annulus between two isother- 
mal concentric square ducts. Comparisons with the existing benchmark solutions 
show that the developed local RBF gridfree scheme is stable and produces accurate 
results. 

Key words: natural convection, radial basis function, multiquadric, fractional- 
step, square enclosure 



1 Introduction 



Natural convective heat transfer has received a lot of attention in the litera- 
ture. Applications include nuclear reactor design, cooling of electronic equip- 
ment, aircraft cabin insulation and thermal storage systems. Numerical so- 
lution of the natural convection problems is challenging mainly due to the 
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nonlinearity and strong coupling that exists between the governing equations. 
Therefore development of faster and efficient numerical schemes is a never 
ending continuing process for the researchers in heat transfer. 

Kansa [1] has initiated the use of RBF interpolation in the global collo- 
cation methods for solving partial differential equations. Due to the global 
nature of the method, the resultant linear system was full and highly ill- 
conditioned. Various approaches, namely, domain decomposition, precondi- 
tioning and the use of compactly supported RBFs [2,3] have been experi- 
mented to circumvent these difficulties. Recently, an RBF-based local method 
has been proposed by Wright and Fornberg [4]. The localization approach 
given in their work has been successfully extended by Chandhini and Sanyasir- 
aju to steady convection-diffusion problems [5] and to unsteady incompressible 
Navier-Stokes equations [6]. 

In the present study, the local RBF scheme developed for unsteady incom- 
pressible viscous Navier-Stokes equations [6] has been extended to solve nat- 
ural convetion problems. The velocity-pressure formulation of the governing 
equations has been employed to model the physical process and a fractional 
step algorithm [7] has been used to decouple velocity and pressure terms. 

The rest of the paper is organized as follows: Section 2 gives a brief description 
of the local RBF based gridfree scheme. The extension of RBF scheme to 
natural convection equations in conjunction with fractional step algorithm 
have been presented in Section 3. Finally, Section 4 discusses the validation 
of the developed scheme using benchmark problems for natural convection in 
closed enclosures and compared with the existing results. 



2 RBF Based Local Scheme 



Let C be any linear differential operator and n be the total number of nodes 
in the discretized domain. Also assume that X{ be any point from the domain. 
Define Si = {x 1 , . . . , x n% } as a neighborhood of x t consisting of n^ (<C n) nodes. 
To approximate Cu(xj) over the nodes from Si, let it be represented as a linear 
combination of u at the points of Si, given by 

rii 
Cu{Xi) « ^CjU{Xj) (1) 

3=1 



Then, the computation of the weights Cj gives the required approximation to 
Cu(xi). To proceed further, also consider the Lagrange representation of an 
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interpolant s(x) of u(x) given by 

Hi 

s (x) = X^j(z)u(%) ( 2 ) 

where ipj(x) y s are the Lagrange functions which satisfy the cardinal conditions, 

ipj(x k ) = 5 jk , j,k = l,2,...,7ii. (3) 

Applying the operator L to the Lagrange representation of RBF interpolant 

(2) gives, 

Cu(xi) « £s(xi) = ^£Vj(^j)w(%) (4) 

3=1 

Comparing equations (1) & (4), c/s can be given as, 

Cj = Cil>j(xi), j = i,...,m (5) 

Therefore, to obtain each Lagrange function ipj in (2), an RBF ($(||.||)) based 
interpolation problem is developed. That is, each ipj is approximated as, 

rii I 

i/jj(x) « ^XjMlx-XklD + ^ljitPkix), j = l,2,...,m. (6) 

fc=l fc=l 

where 0(||.||) is some radial function, {Pj(x)} l j =1 is a basis for LT^ (space of all 
d— variate polynomials with degree < m) and / is the dimension of LT^. The 
weights Xjk and ^j k are obtained by imposing the cardinal conditions (3) and 
orthogonality conditions 

rii 

J2 X jPk(xj) = 0, k = 1, ...,l (7) 

3=1 

on the interpolant (6). From the interpolation problem (6)-(7), by making use 
of the symmetry of the interpolation matrix and some simple properties of 
the determinants, a general form for ipj(x) is obtained (refer [6]) and used in 
equation (5) which leads to, 

Spile 

' CB{xi) 

p t oj yp, 

where 3 i;j = 0(]|x; - Xj\\), i,j = 1, . . . ,n h p itj = Pj(xi), j = 1, ...,/& i = 
1, . . . , rij, /x, a dummy vector corresponding to the vector 7 in (6) and B{x) is 
given by 

B(x) = [<p(\\x - xi\\) (j)(\\x - x 2 \\) ... (p(\\x - x ni \\) I pi(x)p 2 (s) ...p;(x)] T 

(9) 
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Therefore, the weights Cj's in (1) are computed using (8) and (9). It is also clear 
from the development of the final linear system (8) that though it is dense, 
the size of (8) is only rii, which is very much smaller than the size (n) of the 
global RBF collocation system. This makes the system more stable for wide 
range of e. Further, only the right hand side of (8) depends on the operator £, 
for which the weights are to be computed. This optimizes the computation if 
weights have to be computed for many operators with the same distribution 
of nodes, as in the case of non-linear equations. 

Franke [8] , has made a comprehensive comparison of about seven groups con- 
sisting of about 30 interpolation methods on six different test functions and 
found that performance of MultiQuadric (MQ) is the most impressive and con- 
sistently performed better in terms of accuracy. Therefore, the multiquadric 
function, developed by Hardy [9] and defined by 



$(x) 



1 + e 2 \\x\ 



(10) 



where e > 0, has been used in all the computations presented in the subsequent 
sections. 



3 Implementation of RBF Local Scheme to Natural Convection 
Equations 



For two-dimensional problems, the natural convection fluid flow equations in 
non-dimensional form can be written as 



„ . du dv 
Continuity equation: — — \- — = 

ox oz 




(11) 


Momentum equations: 


du du du dP 

7^7 + U TT + V TT = ~^T + Pr 
ot ox oz ox 


d 2 u d 2 u 
dx 2 dz 2 


(12) 


dv dv dv dP 

77T + U TT + V TT = ~^T + Pr 
ot ox oz oz 


d 2 v d 2 v 
dx 2 dz 2 


+ Ra PrT (13) 


dT dT dT d 2 T 
Energy equation: + u +v = 

ot ox oz ox 1 


d 2 T 
dz 2 


(14) 



where (u,v), P and T are dimensionless velocity field, pressure and temper- 
ature, respectively. The necessary initial and boundary conditions depend on 
the chosen problem and will be discussed later. The non-dimensional param- 
eters Raleigh number Ra and Prandtl number Pr are defined as 



Ra 



gpAtH 3 



va 



v 

and Pr = — 

a 



(15) 
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with g acceleration due to gravity, /3 thermal expansion coefficient, H charac- 
teristic length, v kinematic viscosity and a thermal diffusivity of the fluid. 



3.1 Fractional Step Algorithm with Local RBF Scheme 



The procedure, described in this section, is based on the explicit Euler time dis- 
cretization. The development of the scheme based on other temporal schemes 
also can be generated, however, for simplicity Euler scheme has been chosen 
in the presentation. 

Discretization of the momentum equations (12) and (13) and energy equations 
by Euler method gives 



u 



(n+l) _ u (n) 



At 



du du dP „ 

u ~^ — v ^ — ~^r + Pr 

dx dz dx 



d 2 u d 2 u 
dx 2 dz 2 



(n) 



(16) 



V 



(n+l) _ v (n) 



At 



dv dv dP _ 

u ^ V 7T ~ T^~ + Pr 

ox oz oy 



d 2 v d 2 v 
dx 2 dz 2 



\(n) 

+ RaPrT\ (17) 



rp(n+l) _ ^p(n) 

At 



dT dT 

dx dz 



8 2 T 8 2 T 
dx 2 dz 2 



(n) 



dP s 



Rewriting (16) and (17) in the form 

u (n+l) =U * + Af l_\ and v (n+l) =V * + At l 

\ OX J \ OZ 

where u* and v* are given by 

f du du\ „ / d 2 u d 2 u 

ir = w (n) + At - [u— + v— + Pr — + — 

\ ox oz \ ox 2 oz 2 



(n) 



(n) 



V = V 



{n) + At 



' dv dv\ „ ( d 2 v d 2 v\ „ „ m 

u ir + v 7r + Pr brr + 77T + PaPrT 

, dx dz \ dx 2 dz 2 ) 



(n) 



(19) 



(20) 



(21) 



At each time step, the solution is expected to satisfy continuity equation. 
Hence (u,v)^ n+1 ^ must satisfy the continuity equation (11), which implies 



du dv 
dx dz 



(n+l) 



(22) 
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or 



d_ 

dx 



u* + At 



dx 



(n) 



+ 



d_ 

dz 



v* + At 



Equation (23) can be simplified to 



dP_\ 



(«)■ 



(23) 



d 2p(n) Q2p(n) j_ / Q u * Q y * 



dx 2 



dz 2 



At \ dx dz 



(24) 



Equation (24) is a Poisson's equation with non-zero source term ^r + ^7 (since 
u*,v* need not satisfy the continuity equation). Numerical implementation 
of the procedure described from (16) to (24), based on 'local' RBF gridfree 
scheme, can be summarized as follows. 

Assume that for each center Xj, there are nj nodes in the the supporting 
domain, say {x^,^, . . . ,x in }. Using the 'local' RBF scheme compute the 
weights Ci, c 2 , . . . , c ni and c[, c' 2 , . . . , c' n . for the linear operators C = —u^Jj^ — 

v ^~§z ano ^ ^ = ~§x* + ~§z?-> res P ec tively Then the discretized form of the 
equations for temperature and intermediate velocities are, 



u = u 



(n) + At 



(n) 



J2(Cj + Pr C 'j)% 



(25) 



v = v 



(n) + At 



J2(cj + Pr c'jXJ' + RaPrT {n) 



(26) 



rp(n+l) _ rp(n) _|_ ^ 



Y,{cj + fyTi 



1 3=1 



in) 



(27) 



Assume that the weights for ^ and -^ are d\, d 2 , ■ ■ ■ , d ni and d[, d' 2) ■ ■ ■ , d' n ., 
respectively. Using which the pressure equation in (24) can be discretized as 



V^ c '. p(") 



n; 



II, 



j=l j=l j=l 



(2S 



Equation (28) is a sparse linear system which can be solved to obtain P^ 
using any iterative method. In these computations the weights Cj,c'j,dj and 
d'j's are computed only once (where rii «n) and used for all time levels. 
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4 Results and Discussion 



The developed RBF scheme, described in the previous section, is validated 
by solving some benchmark problems from natural convective heat transfer. 
The effect of Rayleigh number on the fluid flow and heat transfer rates in 
square enclosures have been studied and those observations are discussed in 
this section. 



4-1 Natural convection in a square enclosure 



Consider the two-dimensional flow of a Boussinesq fluid of Prandtl number 
0.7 (air) in an upright square cavity described in non-dimensional terms by 
< x < 1, < z < 1 with buoyancy acting vertically upwards. The boundary 
conditions for the velocity components are u — v — on all the four bound- 
ing walls of the cavity due to their stationary nature. The non-dimensional 
temperature T is taken as -0.5 and 0.5 on the walls at x = and 1 while 
the walls at z = and 1 are treated as insulated as shown in the Figure 1. 
Though the developed scheme can work on any arbitrarily distributed centers, 
however, in the present computations for the sake of simplicity, a uniformly 
distributed 81 x 81 points have been used for all the computations. The time 
increment has been fixed as 0.0001. Simulations are carried out, for various 
values of Rayleigh numbers ranging from 10 3 to 10 6 , until steady state solution 
is reached. The results are compared qualitatively by observing the contour 
patterns of stream function, vorticity and temperature (isotherms). Quanti- 
tative comparisons are carried out by looking at the values of \ip m id\, \4>max\, 
u max (z), v max (x) and average Nusselt number Nu. 

The Nusselt number (Nu) has been computed using the local heat flux which 
represent the heat transfer across the cavity. The local heat flux in the hori- 
zontal direction at each point in the duct is given by 

dT 
Q(x,z)=uT- — (29) 

Then on any cross-section (parallel to z-axis), the heat flow can be given as 

Nu x = f Q(x, z)dz (30) 

Jo 

Average Nusselt number (Nu) is calculated by integrating Nu x over x, that 
is, 

Nu~= [ Nu x dx (31) 

Jo 
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Figures 2-4 illustrate the streamlines, vorticity contours and isotherms, re- 
spectively for various values of Ra. Salient features of the flow, namely absolute 
value of the ip at the mid point |?/Vmd|, maximum absolute value of the stream 
function |f/Wr|, the maximum value of the x component of the velocity and 
its vertical location u max (z), the maximum value of the z component of the 
velocity and its horizontal location v max (x), average Nusselt number Nu, heat 
flux at x — 0.5 Nu\/2 and heat flux at x = wall Nuq, have been compared 
quantitatively with the benchmark solutions [10] for different values of Ra. 
The corresponding values have been reported in Table 1. It is evident from 
the streamline patterns reported in Figure 2 that for Ra = 10 3 and 10 4 , max- 
imum absolute value of the stream function occurred at the midpoint of the 
cavity and hence not included in the table. However, it moves away substan- 
tially from the midpoint for Ra = 10 5 and 10 6 . It is also very clear from the 
reported figures that the developed scheme very accurately captures the anti 
symmetric nature of the flow with respect to x — 0.5. The average Nusselt 
number and the maximum value of the x component of the velocity u max are 
compared with [10] at various Ra and reported in Figure 5. The results agree 
with a maximum error less than 4%. 



4-2 Natural convection in an annulus between two concentric square ducts 



The schematic diagram of the problem is shown in Figure 6. Annulus be- 
tween the ducts is filled with a viscous, incompressible Newtonian fluid. In 
the present investigation, fluid with Prandtl number Pr = 0.7 has been con- 
sidered. Two different constant temperatures at the inner and outer walls, 
respectively are maintained with the inner wall temperature greater than that 
of the outer walls. 

Simulations in this problem are carried out for the dimension ratio L — j^- — 
1/5, which determines the width of the annulus. The local Nusselt numbers 
at both inner and outer boundaries are calculated using the equations 

*,„h - =a-*)/' «* = <^>/* (32 ) 

= (1 - L)/2 or x = (1 + L)/2 



, -f:, if z = or z = \ 
Nu ={ d ;' (33) 

-^, if x = or x = 1 

Using the values of Nui n and Nu , the average Nusselt numbers at inner and 
outer squares are computed using the formulae 
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Nu ir 



(l+L)/2 

iv u tn 
(l-L)/2 

(l+L)/2 

(l-L)/2 



Nu in dx 
Nui n dz 



(l+L)/2 

z=(l-L)/2 J0-- L )/ 2 

(l+L)/2 



-Z V Hi if y-> ijj tAj 



Nuindz 



x=(l-L)/2 J ( 1 - L )/ 2 



2=(l+L)/2 



:r=(l+L)/2. 



(34) 



A^M 



Nu dx 



+ 



Nu dx 



Nu dz 



+ 



z=0 



Nu dz 



X = lJ 

(35) 



Rayleigh number has been varied from 10 3 to 10 6 such that the flow undergoes 
a shift from conduction dominated to laminar free convective one. The com- 
putations have been carried out with a 41 x 41 uniform distribution, with time 
step 0.0001, for Ra upto 10 5 . However, for Ra = 10 6 , number of centers has 
been doubled and correspondingly time step has been reduced to 0.00001 to 
retain the stability. Figures 7 and 8 represent the streamlines and isotherms, 
respectively, obtained in the annular region for Ra from 10 3 to 10 6 . From the 
isotherms for different Ra, it is seen that for small values of Ra, temperature 
distribution is conductive in nature, while for Ra > 10 4 effect of natural con- 
vection is more dominant. The average Nusselt number at the walls have been 
compared with [11] and reported in Figure 9. 



5 Conclusions 



A local radial basis function (RBF) based gridfree scheme has been developed 
for solving laminar natural convection problems in enclosures. The developed 
scheme has been discussed elaborately for the governing equations for natu- 
ral convection in conjunction with fractional step algorithm. The benchmark 
problems like Natural convection in a square enclosure, Natural convection 
in an annulus between two concentric square ducts, have been solved and 
compared with the existing results. 



References 



[1] E.J. Kansa, "Multiquadrics - A scattered data approximation scheme with 
applications to computational fluid dynamics - II solutions to parabolic, 
hyperbolic and elliptic partial differential equations", Comput. Math. Appl., 
vol.19, pp.147-161, 1990. 



Dept of CSE, PESIT, Bangalore South Campus 



29 



Proceedings of RETCOMP - 2013, Volume 1 



[2] Z.M. Wong, Y.C. Hon, T.S. Li, S.L. Chug, E.J. Kansa, "Multizone 
decomposition of time dependent problems using the multiquadric scheme" , 
Comput. Math. Appl., vol.37 (8), pp. 23-43, 1999. 

[3] L. Ling, E.J. Kansa, "A least-squares preconditioner for radial basis functions 
collocation methods", Advances in Computational Mathematics 23 (1-2) (2005) 
31-54. 

[4] G.B. Wright and B. Fornberg, "Scattered node compact finite difference- type 
formulas generated from radial basis functions", J. Comput. Phys., vol.212, 
pp.99-123, 2006. 

[5] G. Chandhini and Y.V.S.S. Sanyasiraju. "Local RBF-FD solutions for steady 
convection-diffusion problems" , Int. J. Num. Meth. Eng., vol.72 (3), pp. 352- 
378, 2007. 

[6] Y.V.S.S. Sanyasiraju and G. Chandhini. "Local Radial Basis Function Based 
Gridfree Scheme For Unsteady Incompressible Viscous Flows", J. Comput. 
Phys., vol.227, 8922-8948, 2008. 

[7] J. Kim and P. Moin, "Application of a fractional-step method to incompressible 
Navier-Stokes equations", J. Comput. Phys., vol.59, pp. 308-323, 1985. 

[8] R. Franke, "Scattered data interpolation: Test of some methods", Math. Comp., 
vol.48, pp. 181-200, 1982. 

[9] R. L. Hardy, "Multiquadric equations of topography and other irregular 
surfaces", J. Geophys. Res., vol.76, pp. 1905-1915, 1971. 

[10] G. De Vahl Davis, "Natural convection of air in a square cavity: A bench mark 
numerical solution", Int. J. Num. Meth. Fluids, vol.3, pp. 249-264, 1983. 

[11] H. Asan, "Natural convection in an annulus between two isothermal concentric 
square ducts" Int. Comm. Heat Mass Transfer, vol.3, pp. 367-376, 2000. 



DeptofCSE, PESIT, Bangalore South Campus 30 



Proceedings of RETCOMP - 2013, Volume 1 



Table 1 

Comparison of main features of the solutions of rectangular duct with [10] 
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Fig. 1. Geometrical representation of the square duct 
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Fig. 2. Streamlines for (a) Ra = 10 3 , (b) Ra = 10 4 , (c) Ra = 10 5 , (d) Ra = 10 6 
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Fig. 3. Vorticity contours for (a) Ra = 10 3 , (b) Ra = 10 4 , (c) Ra = 10 5 , (d) 
Ra = 10 6 
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Fig. 4. Isotherms for (a) Ra = 10 3 , (b) Ra = 10 4 , (c) Ra = 10 5 , (d) Ra = 10 6 
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Fig. 5. Comparison of (a )Nusselt number and (b) maximum horizontal velocity (u) 
with Davis [10] 
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Fig. 6. Geometrical representation of the square annulus 
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Fig. 7. Streamlines for with (a) Ra = 10 3 , (b) Ra = 10 4 , (c) Ra = 10 5 , (d) Ra = 10 6 
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Fig. 8. Isotherms for (a) Ra = 10 3 , (b) Ra = 10 4 , (c) Ra = 10 5 , (d) Ra = 10 6 
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Fig. 9. Comparison of average Nusselt number for outer wall for the dimensions L 

= 1/5 
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Abstract— Image retrieval is an important topic in the field of pattern recognition and artificial intelligence. There 
are three categories of image retrieval methods: text-based, content-based and semantic-based. In Content based 
Image Retrieval (CBIR), images are indexed by their visual content, such as color, texture, shapes. A new image 
feature detector and descriptor, namely the micro-structure descriptor (MSD) is discussed to describe image features 
via micro-structures. The micro-structure is defined based on the edge orientation similarity, and the MSD is built 
based on the underlying colors in microstructures with similar edge orientation. Content-based image retrieval 
(CBIR) is the mainstay of image retrieval systems. To be more profitable relevance feedback techniques are 
incorporated into CBIR such that more precise results can be obtained by taking user's feedbacks into account. The 
semantic gap between low-level features and high-level concepts handled by the user is one of the main problems in 
image retrieval. On the other hand, the relevance feedback has been used on many CBIR systems such as an effective 
solution to reduce the semantic gap. The gap is reduced by using the Multitexton Histogram descriptor (MTH) . 

In this paper, we propose a novel approach to reduce semantic gap in CBIR using relevance feedback based on the 
MSD and MTH descriptors to achieve high efficiency and effectiveness of CBIR in coping with the large-scale image 
data. For that reason this paper proposes a method of relevance feedback based on the Microstructure descriptor 
(MSD) for efficient feature extraction of an image and Multitexton Histogram descriptor to represents the effective 
feature representations. By using this method, high quality of image retrieval on Relevance Feedback can be achieved 
in a small number of feedbacks. In terms of efficiency, iteration of feedback is reduced substantially by using the 
navigation patterns discovered from the user query log, which reduce the computational processing time. 

Keywords — CBIR, Relevance Feedback, Semantic Gap, Microstructure descriptor (MSD), Multitexton Histogram 
descriptor. 

I. INTRODUCTION 

With the steady growth of computer power, rapidly declining cost of storage and ever-increasing access to the 
Internet, digital acquisition of information has become increasingly popular in recent years. This trend has 
motivated research in image databases, which were nearly ignored by traditional computer systems due to the 
enormous amount of data necessary to represent images and the difficulty of automatically analyzing images. 
Currently, storage is less of an issue since huge storage capacity is available at low cost. In many areas of 
commerce, government, academia, and hospitals, large collections of digital images are being created. Many of 
these collections are the product of digitizing existing collections of analogue photographs, diagrams, drawings, 
paintings, and prints. The Content Based Image Retrieval System (CBIR) is a system, which retrieves the 
images from an image collection where the retrieval is based on a query, which is specified by content and not 
by index or address. The query image is an image in which a user is interested and wants to find similar images 
from the image collection. The primary goal of the CBIR system is to construct meaningful descriptions of 
physical attributes from images to facilitate efficient and effective retrieval. 

A way to retrieve this information is through the CBIR systems, although a large amount of research has been 
developed in this field, the performance in this system has not yet been successful due to the existence of 
semantic gap. Because of we use high-level concepts such as keywords or text descriptions to describe images 
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content and/or to measure their similarity. While that the computer-vision systems can automatically extracted 
low-level features from images such as color, shape, texture and spatial relationships. On the other hand there is 
also subjectivity of human perception of visual content of the images because of different persons or the same 
person under different circumstances, may perceive the same visual content differently during the information 
retrieval process [1], [2]. In that sense the CBIR systems try to reduce the semantic gap through relevance 
feedback based on the event activated human computer interaction model, which is a challenging task. The 
relevance feedback (RF), adapts the response of a system according to the relevant information fed back to it so 
that the adjusted response is a better approximation to the users information needs [3]. 

While there is much research effort addressing content-based image retrieval issues, the performance of 
content-based image retrieval methods are still limited, especially in the two aspects of retrieval accuracy and 
response time. The limited retrieval accuracy is because of the big gap between semantic concepts and low-level 
image features, which is the biggest problem in content -based image retrieval [4]. For a given query, the system 
first retrieves a list of ranked images according to predefined similarity metrics, which are often defined as the 
distance between feature vectors of images. In order to improve the retrieval accuracy of content -based image 
retrieval systems, research focus has been shifted from designing sophisticated low-level feature extraction 
algorithms to reducing the 'semantic gap' between the visual features and the richness of human semantics. 

The micro-structures are defined based on an edge orientation similarity, and the MSD [5] is built based on 
the underlying colors in microstructures with similar edge orientation. With microstructures serving as a bridge, 
the MSD extracts features by simulating human early visual processing and it effectively integrates color, 
texture and shape and color layout information as a whole for image retrieval. 

In each iteration of the RF method, the system returns some images as the result for a query image and then 
the user evaluates the results according to his/her demands and preferences, and tells the system which images 
are relevant and which are irrelevant. The MTH is a new feature representation and descriptor to retrieval 
contend image which integrates the advantages of co-occurrence matrix and histogram by representing the 
attribute of co-occurrence matrix using histogram. The generation process of the tree is based on the results of 
similarity between images using the MTH descriptor, the assignment of weights and the user interaction. 

The rest of this paper is organized as follows. The related work is described in Section II. Section III deals 
with the proposed work which describes the relevance feedback approach for CBIR system, and finally section 
IV concludes the paper. 

II. RELATED WORK 

Guang-Hai Lieu .al, [5] proposed that human visual attention is enhanced through a process of competing 
interactions among neurons, which selects a few elements of attention and suppresses irrelevant materials. In 
order to extract features via simulating visual processing procedures and effectively integrate color, texture, 
shape features and im age color layout information as a whole for image retrieval, a novel feature detector and 
descriptor, namely microstructures descriptors (MSD). The MSD has advantages of both statistical and 
structural texture description approaches. In addition, the MSD algorithm simulates human visual perception 
mechanism to some extent. Ja-Hwung et. al, [6] described that conventional approaches for image retrieval are 
based on the computation of the similarity between the user's query and images via a query by example (QBE) 
system. Despite the power of the search strategies, it is very difficult to optimize the retrieval quality of CBIR 
within only one query process. The hidden problem is that the extracted visual features are too diverse to capture 
the concept of the user's query. To solve such problems, in the QBE system, the users can pick up some 
preferred images to refine the image explorations iteratively. The feedback procedure, called Relevance 
Feedback (RF), repeats until the user is satisfied with the retrieval results. 

Marakakis et .al, [7]introduced new relevance feedback (RF) approach for content-based image retrieval 
(CBIR), which uses Gaussian mixture (GM) models as image representations. The GM of each image is 
obtained as an adaptation of a universal GM which models the probability distribution of the features of the 
image database. In each RF round, the positive and negative examples provided by the user until the current 
round are used to train a support vector machine (SVM) to distinguish between the relevant and irrelevant 
images according to the preferences of the user. Lu Hui et. A,l [8] introduces a relevance feedback system for 
CBIR with both short-term relevance feedback and long-term learning. In short-term relevance feedback, query 
reweighting algorithm, support vector machines (SVM), and genetic algorithm are adopted. In long-term 
learning, the expanded-judging model with index table is used for analyzing the historical log data. Through 
using the expanded judging model to analyze the historical feedback records, the system finds the semantic 
support to refine the similarity measure and improve the retrieval performance. 

In [9], Mahmoudi et al. proposed the edge orientation autocorrelogram (EOAC) for shape -based image 
indexing and retrieval. It can be used for edge-based image indexing and retrieval without segmentation. The 
EOAC is invariant to translation, scaling, color, illumination, and small viewpoint variations, but it is not 
appropriate for texture-based images retrieval. Lowe [10] proposed a very effective algorithm, called scale- 
invariant feature transform (SIFT), in computer vision to detect and describe local features in images. It has 
been widely used in object recognition, robotic mapping and navigation, image stitching, 3D modeling, gesture 
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recognition, video tracking, etc. Banerjee et al. [11] proposed to use edge-based features for CBIR. The 
algorithm is computationally attractive as it computes different features with limited number of selected pixels. 

The texton co-occurrence matrices (TCM) proposed in [12] can describe the spatial correlation of textons for 
image retrieval. It has the discrimination power of color, texture and shape features. Kiranyaz et al. [13] 
proposed a generic shape and texture descriptor over multi-scale edge field for image retrieval, which is the so 
called 2-D walking ant histogram (2D-WAH). As a shape descriptor, it deals directly with natural images 
without any segmentation or object extraction preprocessing stage. When tuned as a texture descriptor, it can 
achieve good retrieval accuracy especially for directional textures. 

Luo et al. [14] developed a robust algorithm called color edge co-occurrence histogram (CECH), which is 
based on a particular type of spatial- color joint histogram. This algorithm employs perceptual color naming to 
handle color variation, and pre-screening to limit the search scope (i.e. size and location) of the object. A simple 
and interactive method which shows effectiveness in natural image retrieval using relevance feedback through 
the generation of trees based on the Multitexton Histogram descriptor - MTH [18]. MTH descriptor analyzes the 
spatial correlation between neighboring color and edge orientation based on four special texton [15] types. 
Because in the natural scenes the color and texture have close relationship in terms of fundamental elements and 
they are considered as atoms for pre-attentive human visual perception. RF confronts the subjectivity of humans 
in perceiving visual content and also eliminates the gap between high-level semantics and low level features, 
which are often used for content description and modeling [16], [17]. 

Guang-Hai Liu et. al, [18] presented a novel image feature representation method, called multi-texton 
histogram (MTH), for image retrieval. MTH integrates the advantages of co-occurrence matrix and histogram 
by representing the attribute of co-occurrence matrix using histogram. The MTH method is based on Julesz's 
textons theory, and it works directly on natural images as a shape descriptor. Meanwhile, it can be used as a 
color texture descriptor and leads to good performance. MTH can represent both the spatial correlation of 
texture orientation and texture color based on textons. MTH is well suited for large-scale image dataset retrieval. 

III. PROPOSED SYSTEM 

Fig. 1 . shows the general scheme of the proposed system using relevance feedback in CBIR. The basic idea of 
relevance feedback is to shift the burden of finding the right query formulation from the user to the system. 

Algorithm for proposed system is as follows: 

Stepl: The user gives query image to the proposed system. 

Step2: On given query image the system is first extract the low-level features of the image using the 

microstructure descriptor. 

Step3: System will transform the low-level features to high-level features of the image will represents using the 

Multitexton histogram descriptor technique which is discussed in the following sections. 

Step4: On the result of step 3, Performs the similarity measure of given image with the images in the image 

database. 

Step5: Based on the similarity measure values the set of images will be visualized. 

Step6: This set is supplied to the next phase of the proposed system i.e. relevance feedback, here the system 

represents the knowledge of the image. From it finds the high-level semantics of image. 

Step7: On the result of step 6 is the image set displays to the user. 

Step8: On this set user will give the feedback whether retrieved images are relevant or irrelevant to the query 

image. If images are relevant to the query image then system will be terminate and gives the final set of result to 

user. Else, repeat the steps from step 2 to step 7 until user satisfies with the result. 

The proposed system architecture consists of two main phases 
(i) Query Processing Phase, 
(ii) Relevance Feedback. 
(i) Query Processing Phase (QPP): In this step, the system will extract the low-level features from the image 
using the Micro Structured descriptor (MSD) and then on it, applies the Multilayer Neural network (MNN) to 
transform the high-level features of query image and features of an image will represents with Multitexton 
Histogram descriptor. After this, it is given to the next step to perform the similarity measure in between the 
queried image and images in the database, according to the similarities the system will automatically gives the 
ranks to the image. So that the user can easily finds the most relevant images by checking the ranks given by the 
system. In this step our proposed system, gives the promising results (most relevant images) to the user. 

(ii) Relevance Feedback (RF): This step divided into two main parts. The first part is knowledge representation 
and second is high-level semantics. Using these two steps, the system will be transforms the low-level features 
into high-level vectors. 
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Fig.l. Proposed Relevance Feedback system architecture 
The following sections describes in detail about important steps of proposed system. They are as follows: 

3.1 MICROSTRUCTURE DESCRIPTOR (MSD) 

The contents in digital images can vary significantly so that directly comparing them is infeasible for 
applications such as image retrieval. However, the local structures of images from the same class (e.g. textile, 
mountains, etc.) often show a certain amount of similarity. The structural approach assumes that texture is 
formed with simple primitives called "texels" (texture elements) by following some placement rules. For 
example, the local binary pattern [19] can be considered as a type of texture elements. A typical example of 
"texels" is Julesz's textons theory [20] [21], but it emphasizes on regular texture images. To address this 
problem, the concept of micro-structures is proposed in this paper for image retrieval. In some sense, we may 
think that the meaningful content of natural images is composed of many universal micro-structures. Therefore, 
if we could extract these micro -structures and describe them effectively, they can serve as common bases for the 
comparison and analyses of different images. This is the essential idea of this paper, and we call the proposed 
technique micro-structure descriptor (MSD). 

3.2 MICROSTRUCTURED IMAGE 

Natural scenes are rich in color, texture and shape information and it can be considered as a mosaic of 
regions with different colors, textures and shapes. In general, textons are defined as a set of blobs or emergent 
patterns sharing a common property. Here, in fig. 2. micro-structures are defined as the collection of certain 
underlying colors which have similar or the same edge orientation in uniform color space that combines color, 
texture and shape cues as a whole. In order to find the micro-structures, which have similar attributes such as 
edge orientation and color distribution, we partition the image into many small blocks, which can be a grid of 
size 2x2, 3x3, 5x5, 7x7 and so on. For the convenience of expression, the 3x3 block is used in the following 
development of microstructure analysis. In the 3x3 block, if one of the eight nearest neighbors has the same 
value as the center pixel, then it is kept unchanged; otherwise it is set to empty. 
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Fig. 2 (a) edge orientation map; (b)-(c) show the microstructure detection process; and (d) shows the detected microstructure 

In Fig. 3. it shows the illustration of micro-structure map extraction process. In Fig.4. shows the steps to 
Microstructure image formation. In Fig. 5. shows the two examples of an MSD: (a) sailing ship and (b) bus. 
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3.3 SIMILARITY MEASURE 

Suppose there is an edge orientation map y(x, y) of size WxN. When we move the 3x3 block from left-to-right 
and top-to-bottom throughout the image, the detected fundamental microstructures in a neighborhood can 
overlap. 

The following steps are used to obtain the micro structured image: 

(1) Starting from the origin (0,0), we move the 3x3 block from left-to-right and top-to-bottom throughout the 
edge orientation image y(x,y) with a step-length of three pixels along both horizontal and vertical directions. 
Then, we will obtain a micro-structure map, denoted by Ml(x,y), where 0<x <W-1, 0<y<N-l. 

(2) Starting from the location (1,0), we move the 3x3 block from left-to-right and top-to-bottom with a step- 
length of three pixels along both horizontal and vertical directions. Then, a micro -structure map M2(x,y) is 
obtained, where l<x<W-l, 0<y<N-l. 

(3) Similarly starting from location (0, 1), we can have the third micro -structure map M3(x,y), where 0<x<W-l, 
l<y<N-l. 

(4) Starting from location (1, 1), we can have the fourth microstructure map M4(x,y), where l<x<W-l, l<y<N- 
1. 

(5) The final micro-structure map, denoted by M(x,y), is obtained by fusing the four maps based on the 
following rule: M(x,y)=Max{Mi(x,y),M 2 (x,y),M 3 (x,y),M 4 (x,y)} 

3.4 MULTITEXTON HISTOGRAM DESCRIPTOR (MTH) 

Liu et al. [22] [23] presents a novel image feature representation method, called multitexton histogram 
(MTH), for image retrieval. MTH integrates the advantages of co-occurrence matrix and histogram by 
representing the attribute of co-occurrence matrix using histogram. It can be considered as a generalized visual 
attribute descriptor but without any image segmentation or model training, a new descriptor namely multi- 
texton histogram - MTH, for natural image retrieval which represents the spatial correlation between 
neighboring color and edge orientation based on textons [15] MTH integrates the advantages of co-occurrence 
matrix and histogram. Natural scenes are usually rich in both color and texture, and a wide range of natural 
images can be considered as a mosaic of regions with different colors and textures [18]. For that reason in these 
natural scenes the Color and texture have close relationship in terms of fundamental elements and they are 
considered as atoms for pre-attentive human visual perception [26] [27]. 
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Fig. 3. Illustration of microstructure map extraction process 
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Fig.4. Micro-structure image formation. 
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Fig. 5. Two examples of an MSD: (a) sailing ship and (b) bus. 



In this sense MTH represents an image by two matrices to analyze independently the spatial correlation 
between neighboring color and edge orientation based on four special texton types which are represented by 
four grids as shown in Fig. 6 (a). To each texton if the two pixels highlighted have the same value, then the grid 
will form a texton. Fig. 6 ((b), (c), (d) and (e)) shows the four special texton types of the proposed MTH. 

The working mechanism of texton detection is illustrated in Fig. 4. The 2x2 grids are moved from left-to-right 
and top-to-bottom throughout the edge orientation index image to detect textons with 2 pixels as the step-length. 
If a texton is detected, the original pixel values in the 2x2 grids are kept unchanged. Otherwise it will have zero 
value. Finally, the texton image is obtained, Fig. 6(e), which is used such as a mask to detect textons in the color 
index image. 
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Fig.6. Four texton types (a) 2x 2 grids, (b) Tl, (c) T2, (d) T3, (e) T4 

In order to get the spatial correlation between neighboring color and edge orientation based on textons, MTH 
considers four orientations 0%, 90%, 45% and 135% with one of distance. Then to each orientation MTH forms 
a feature vector with 82 dimensions, where the first 64 dimensional vectors represent the spatial correlation 
between neighboring texture orientation by using color information and the last 18 dimensional vectors 
represent the spatial correlation between neighboring colors by using the texture orientation information. 
Finally, MTH descriptor calculates the average of the histograms to four different orientations. The Fig. 5 shows 
an example of an original image with this MTH descriptor. 

The working mechanism of texton detection is illustrated in Fig. 7. In the color index image C(x, y), we 
move the 2X2 block from left-to-right and top-to-bottom throughout the image to detect textons with 2pixels as 
the step-length. If a texton is detected, the original pixel values in the 2X2 grids are kept unchanged. Otherwise 
it will have zero value. Finally, we will obtain a texton image, denoted by T(x,y). The four texton types used in 
MTH contain richer information than those in TCM because the co-occurring probability of two same-valued 
pixels is bigger than that of three or four same-valued pixelsina2X2 grid. As for the texton detection procedure, 
MTH is also faster than TCM. In the texton detection of TCM, the 2X2 grid moves throughout the image with 
one pixel as the step-length, and the detected textons in a neighborhood may overlap. The final texton image 
needs to be fused by the overlapped components of textons, and this will increase the computational complexity. 
Therefore, in this paper the step- length is set to two pixels to reduce the computational cost. 

3.5 RELEVANCE FEEDBACK 

The Relevance Feedback is a technique that has been used quite successfully in human computer interaction; 
because of it allows users express better their needs in the specification of a query. In order to achieve this, after 
a user submits a query by a given sample, the system 
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will return a set of similar images to the user. The returned images may not be fully relevant to the user's 
targets. In order to learn the query concept of the user, relevance feedback is engaged as a query refinement 
technique for helping the retrieval task. The relevance feedback [24] mechanism solicits the user to mark the 
relevance on the retrieved images and then refines the results by learning the feedbacks from the user, at the 
same time that the system adjusts the low-level features of the images. The relevance feedback procedure is 
repeated again and again until the targets are found [28]. 
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Fig.7. Mechanism of texton detection to get the texton matrix 





Fig. 8. MTH Descriptor to the image of elephant in his habitat (a) original image (b) MTH descriptor 
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A typical scenario for relevance feedback in content -based image retrieval is as follows: [29]: 
i. Machine provides an initial retrieval results, through query-by-keyword, sketch, or example, etc. 
ii. User provides a judgment on the currently displayed images as to whether, and to what degree, they are 
relevant or irrelevant to her/his request, 
iii. Machine learns and tries again. Go to step 2. 

This process is illusttated in Fig. 10. 
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Fig. 10. A typical Relevance Feedback process 

Within the traditional approach to relevance feedback are for example the method of the readjustment of 
weights of the feature query vector [17], where the user has to precisely decompose his information need into 
different feature representations and precisely specify all the weights associated with them, the proposed 
interactive approach allows the user to submit a coarse initial query and continuously refine his information 
need via relevance feedback. 

Another classic approach is the query point movement method (QPM), which aggregates the positive 
examples as a new query point at each feedback. After several a forceful changes of locations and contours, 
query point can be very close to a convex region of user's interest. This method is based on the Rocchio's 
formula which has been used in numerous applications [30]. 

IV. IMPLEMENTATION DETAILS 

The Query Processing Phase discussed above has been implemented which involves the generation of 
Micro Structured Image based on the given query. The implementation includes. 

4.1 COREL DATASETS 

This section describes the test done based on the proposed model and optimization obtained by the model. 
Corel dataset has become a default standard in demonstrating the performance of CBIR systems. In this section, 
we evaluate the performance of our method by using Corel dataset. All Corel images contain around 500 
images. Corel image database contains a large amount of images of various contents ranging from animals and 
outdoor sports to natural scenarios. 

4.2 DISTANCE MEASURE 

For each template image in the dataset, an M-dimensional feature vector T= [Tl, T2, y, TM] is extracted 
and stored in the database. Let Q=[Q1, Q2, y, QM] be the feature vector of a query image, the LI distance 
between them is simply calculated as 

M 

D(T,Q)= £m-!2il (1) 

i=i 

4.3 PERFORMANCE EVALUATION METRICS 

The precision and recall indices are used to evaluate the performance of the proposed method. The two 
indices are the most commonly used measurements for evaluating image retrieval performance. Precision is the 
ratio of the number of retrieved similar images to the number of retrieved images, while recall is the ratio of the 
number of retrieved similar images to the total number of similar images. They are defined as follows: 
P=I N /N " (2) 

R= I N /K (3) 

where I N is the number of similar images retrieved, 
N is the total number of images retrieved and 
K is the total number of similar images. 
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IV. CONCLUSION 

In this paper we proposed a new method to reduce the semantic gap in CBIR using relevance feedback image 
retrieval based on Multitexton Histogram descriptor. To deal with the long iteration problem of CBIR, can be 
reduced using Micro Structure Descriptor .In summary, the main feature of Relevance Feedback is to efficiently 
optimize the retrieval quality of interactive CBIR. Initially, the feature representation done with Multitexton 
Histogram, then generated micro structured image is used in retrieving images from the database. Later, from 
the retrieved image set user feedback is given for a pattern -based search to match the user's intention. Finally 
the most relevant set of images are retrieved from the database. As a result, traditional problems such as visual 
diversity and exploration convergence are solved. 
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Abstract — This paper proposes an adaptive calibration circuit 
for flow measurement technique by ultrasonic transducers using 
an Optimized Artificial Neural Network (OANN). The objectives 
of the present work are (i) to extend the linearity range of 
measurement to 100% of full scale input range, (ii) to make the 
measurement technique adaptive to variations in (a) pipe 
diameter, (b) liquid density, (c) liquid temperature, and (iii) to 
achieve objectives (i) and (ii) by using an optimized neural 
network. The output of ultrasonic transducer is frequency. It is 
converted to voltage by using a suitable data conversion unit. A 
suitable optimal ANN is added, in place of conventional 
calibration circuit, in cascade to data conversion unit. ANN is 
trained, and tested with data considering variations in pipe 
diameter, liquid density, and liquid temperature. Results show 
that the proposed technique has fulfilled the desired objectives. 

Keywords- Artficial Neural Network; Calibration; 
Linearization; Ultrasonic flow sensor; Sensor modelling 



I. 



Introduction 



Flow measurement has evolved over the years in response 
to measure new products, measure old products with new 
condition of flow, and for higher accuracy requirement as the 
value of fluid has gone up. Flow measurement is the 
quantification of fluid movement. Flow can be measured in a 
variety of ways, may be by contact type or non-contact type of 
sensor. Positive-displacement flow meters accumulate a fixed 
volume of fluid and then count the number of times the 
volume is filled to measure flow. Other flow measurement 
methods rely on forces produced by the flowing stream as it 
overcomes a known constriction, to indirectly calculate flow. 
Flow may be measured by measuring the velocity of fluid over 
a known area. Accurate flow measurement is not only an 
essential and vital requirement from qualitative view point but 
also highly essential from economic aspect. Among the non 
contact type of flow measurement, ultrasonic flow 
measurement is widely used method to measure flow, because 
of its advantage, like, high resolution, less interference of 
noise on output. However, non linear characteristics of 
Ultrasonic Flow Meter (UFM) have restricted its use. 

Literature review suggests that several techniques are 
adopted to calibrate the UFM. In [1], linearization of UFM is 
discussed using look up table designed on a processor. In [2], 
calibration of UFM was carried on with the help of neural 
network algorithm. In [3-5], [13] calibration of flow meter 



using several hardware circuits is discussed. In [6], calibration 
of flow meter using curve fit algorithm of Lab VIEW is 
discussed. In [7], [10] Calibration of UFM and compensation 
of temperature effect is discussed using neural network 
algorithm. Temperature compensation is also done by using a 
redundant sensor. In [8], calibration of flow sensor is 
discussed using fuzzy algorithms. In [9], calibration of flow 
sensor is discussed using polynomial equations. In [11], 
calibration of flow sensor is discussed using neural network 
algorithms. In [12], calibration is discussed using curve fitting 
polynomial equations. In [14], calibration of UFM and making 
it independent of pipe diameter, liquid density and temperature 
is discussed using Levenberg Marquardt Algorithm (LMA) of 
ANN is discussed. In [15], effect of liquid temperature on 
flow measurement is discussed. Reference [16] shows the 
effect of liquid density on flow measurement. In [17], effect of 
physical dimensions of flow meter is discussed. 

To overcome the restriction faced due to nonlinear 
response characteristics of the ultrasonic flow meter, several 
techniques, as stated above, have been reported. But some of 
these are tedious and time consuming. Further, the process of 
calibration needs to be repeated or calibration circuit need to 
be replaced/ tuned whenever there is a change in pipe diameter 
or liquid density. The problem of nonlinear response 
characteristics of a UFM further aggravates when there is a 
change in liquid temperature, since the output of an UFM is 
also dependent on temperature. 

An intelligent flow measurement technique is proposed in 
this paper which is made linear for the full scale input range 
and adaptive of variations in pipe diameter, liquid density, and 
liquid temperatures. The paper is an extended version of [14]. 
The extended version uses an optimized ANN instead of an 
arbitrary scheme and algorithm of ANN as considered in the 
original paper. The optimized ANN is achieved considering 
five different algorithms and schemes and comparing their 
MSE and Regression. The ANN model which gives least MSE 
and Regression close to one is considered as optimized ANN. 
Further, in this extended version, the proposed technique is 
subjected to practical data for validation. 

The paper is organised as follows: after introduction in 
Section-I, a brief description on ultrasonic flow meter is given 
in Section-II. The output of the ultrasonic flow meter is 
frequency; a brief discussion on data conversion i.e. a 
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Frequency to voltage converter is discussed in Section-Ill. 
Section-IV deals with the problem statement followed by 
proposed solution in Section-V. Result and discussion is given 
in Section-VI, and the paper is ended with conclusion and 
future scope of work in Section- VII. 



II. 



Ultrasonic flowmeter 



Ultrasonic flow meters have gained a lot of attention over 
the past years, primarily because of their ability to measure 
custody transfer of natural gas. They are replacing differential 
pressure and turbine flow meters in many natural gas 
applications. UFMs are also widely used to measure liquid 
flow. This is not limited to clean liquids only. UFM can 
accurately measure the flow of slurries and liquids with many 
impurities 

Ultrasonic flow meters are one of the most interesting 
types of meters used to measure flow in pipes. The most 
common variety, transit time, has both a sending and a 
receiving transducer. Fig. 1 shows the arrangement of one 
such UFM. Both sending and receiving transducers are 
mounted on either side of the flow meter, or pipe wall. The 
sending transducer sends an ultrasonic signal at an angle from 
one side of the pipe which is received by the receiving 
transducer. The flow meter measures the time that the 
ultrasonic signal takes to travel across the pipe in forward and 
reverse direction. When the signal travels along the direction 
of the flow, it travels more quickly compare to the condition 
of no flow. On the other hand, when the signal travels against 
the direction of flow, it slows down. The difference between 
the "transit times" of the two signals is proportional to flow 
rate [18], [19]. 
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Pipe j_ Ttl own j 

Figure 1 . Arrangement of Ultrasonic flow meter 

For Fig.l, we have 

™ D 

M* 



U P C +vsine 



X 



M*- 



down 



AT = T„ 



C o -vsin0 



(1) 



(2) 



T H 



'up 'down (3) 

Frequency, f IN = 1/AT 
where: 

M - No of times ultrasonic signal travels in forward/ 

backward direction 
C - Velocity of ultrasonic signal in static fluid 
D - pipe diameter 




v - velocity of fluid 

The velocity of ultrasonic signal depends on [14], [16], 
[19], [20] density of liquid as, 



(4) 



with 

k - bulk modulus and 
p - Density of liquid 

Effect of temperature on density [14-16], [19-21] can be given 
by 

p =r — ^ — i/ri- (Ptl " Pto) i 

where: 



(5) 



Pi - specific density of liquid at temperature t, 

p - specific density of liquid at temperature t 

P,i - pressure at temperature tj 

P t0 - pressure at temperature t 

E- Modulus of elasticity of the liquid 

a - temperature coefficient of liquid 

III. Data conversion unit 

The block diagram representation of the proposed 
technique is given in Fig. 2. 
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Figure 2. Block diagram of the proposed flow measurement technique 

The LM2917 series are monolithic frequency to voltage (f- 
V) with a high gain op amp/comparator designed to operate a 
relay, lamp, or other load when the input frequency reaches or 
exceeds a selected rate [22] . 

The op amp/comparator is fully compatible with the 
sensors and has a floating transistor as its output. The collector 
may be taken above V cc up to a maximum V CE of 28 V. 



Vqut — flN * R * C * V cc 



(6) 



IV. Problem statement 



In this section, characteristic of ultrasonic flow meter is 
simulated to understand the difficulties associated with the 
available measurement technique. For this purpose, simulation 
is carried out with three different pipe diameters. These are D 
= 0.1 m, 0.2 m, and 0.3 m. Three different liquid densities are 
considered. These are p = 500 Kg/m 3 , 1000 Kg/m 3 , and 1500 
Kg/m 3 . Three different liquid temperatures are considered. 
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These are t = 25 °C, 50 °C, and 75 °C. The output frequency of 
UFM is calculated using eqn. 3, 4, and 5, with respect to 
various values of input flow considering a particular pipe 
diameter, liquid density, and temperature. These output 
frequencies are used as input of frequency to voltage converter 
circuit and by using eqn. 6, the output voltages are generated. 
The MATLAB environment is used of and the following 
characteristics are found. 
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Figure 3. Output of data conversion unit for various flow rates and 
temperatures when liquid density is 500 Kg/m and pipe diameter is 0. 1 m 
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densities when temperature is 50 °C and pipe diameter is 0.3 m 
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Figure 5. Output of data conversion unit for various flow rates and pipe 
diameters when liquid density is 1500 Kg/m 3 and temperature is 75 °C 



Fig. 3, Fig.4, and Fig. 5 show the variation of voltages with 
the change in flow rate considering different values of pipe 
diameter, liquid density, and temperature. It has been observed 
from the above graphs that the output from the frequency to 
voltage converter circuit has a non linear relation. Datasheet of 
ultrasonic suggests that of 10% to 50% of full scale input 
range is used in practice. The output voltage varies with the 
change in pipe diameter, liquid density, and temperature. 
These are the reasons which have made the user to go for 
repeated calibration techniques using some circuits. These 
conventional calibration techniques have drawbacks that these 
are time consuming and need to be recalibrated whenever pipe 
diameter, and/or liquid density, and/or liquid temperature is 
changed in the system. Further the use is restricted only to a 
portion of full scale of input range. 

To overcome these drawbacks, this paper makes an 
attempt to design a flow measurement technique using UFM 
incorporating intelligence to produce linear output for the full 
range of input scale and to make the system adaptive of 
variation in pipe diameter, liquid density, and temperature 
using neural network taking an optimized ANN model. 

Problem statement: given an arrangement for 
measurement of flow consisting of UFM in cascade with 
frequency to voltage converter circuit, design an intelligent 
non contact flow measurement technique using optimized 
neural network model and having the following properties: 



i. 
ii. 
Hi. 

XV, 

v. 
vi. 



Adaptive to variations in diameter of the pipe. 

Adaptive to variations in liquid density. 

Adaptive to variations in liquid temperature. 

Output bears a linear relation with the input flow rate 

Full scale input range can be measured. 

All using an optimized ANN. 



V. Problem solution 

The drawbacks discussed in the earlier section are 
overcomed by adding an optimized ANN model in cascade 
with data converter unit replacing the conventional calibration 
circuit. This model is designed using the neural network 
toolbox of MATLAB. 

The first step in developing a neural network is to create a 
database for its training, testing, and validation. The output 
voltage of data conversion unit for a particular flowrate, pipe 
diameter, liquid temperature, and liquid density is stored as a 
row of input data matrix. Various such combinations of input 
flowrate, pipe diameter, liquid temperature, liquid density, and 
their corresponding voltage at the output of data conversion 
unit are used to form the other rows of input data matrix. The 
output matrix is the target matrix consisting of data having a 
linear relation with the input flowrate and adaptive of 
variations in pipe diameter and/or liquid density and/or liquid 
temperature as shown in Fig 6. 
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Flow rate in cubic meter per second * 10 



Figure 6 Target graph 

The process of finding the weights to achieve the desire 
output is called training. The optimized ANN is found by 
considering different algorithms in first step and then by 
varying the transfer functions of neurons in second step to find 
an optimized structure with minimum number of hidden layers 
subject to MSE less than the predefined value. MSE is the 
average squared difference between outputs and targets. 
Lower values of MSE are better. Zero means no error. 

Four different algorithms with back propagation scheme 
are used to find the optimized ANN in first step. These are 
Back Propagation (BP) trained by Ant Colony Optimization 
(ACO) [23 - 25], Back Propagation trained by Artificial Bee 
Colony (ABC) [26], [27], Back Propagation (BP) trained by 
Genetic Algorithm (GA) [28 - 30], and Back Propagation 
trained by Particle Swarm Optimization (PSO) [31], [32]. 
Training of ANN is first done assuming only one hidden layer. 
MSE values are noted. Hidden layer is increased to 2 and 
training is repeated. This process is continued up to 5 hidden 
layers. In all cases MSE are noted and shown in Table 1. 
MSE's corresponding to different algorithms and number of 
hidden layers is shown pictorially in Fig. 6. Table. 1 and Fig. 6 
very clearly shows that BP trained by PSO yields most 
accurate results. BP trained by PSO with 1 hidden layer is 
considered as the most optimized ANN for desired accuracy of 
result. Details of the optimized neural network is given in 
Table-3 

TABLE I. COMPARISON OF MSE FOR NEURAL NETWORK MODELS 



Lay 
ers 


BP_ ACO 


BP_ ABC 


BP_GA 


BP_PSO 


1 


1.00E-4 


8.68E-5 


6.28E-5 


4.11E-5 


2 


5.84E-7 


3.47E-7 


1.08E-7 


9.12E-8 


3 


1.85E-9 


9.12E-10 


7.85E-10 


5.89E-10 


4 


1.11E-11 


8.61E-12 


6.38E-12 


4.85E-12 


5 


2.58E-13 


1.01E-13 


7.96E-14 


5.47E-14 


6 


1.01E-15 


8.67E-16 


5.22E-16 


3.87E-16 



Different transfer functions of neuron are used in literature. 
In second step training, testing, and validation are repeated 
with ten different transfer functions of neuron on the 
optimized ANN. BP_PSO with one hidden layer obtained in 



the first step. The effect of neuron TFs in terms of MSE are 
noted and tabulated in Table-2. Axon TF is finally used in the 
optimized ANN based on the outcome of second step, as 
shown in the Table-2. 

Table II. Comparison of different neuron transfer function 



Sl.no 


Transfer function 


MSE 


1. 


Tanh 


1.38E-5 


2. 


Sigmoid 


1.02E-5 


3. 


Linear Tanh 


8.87E-6 


4. 


Linear sigmoid 


7.92E-6 


5. 


Softmax 


3.08E-6 


6. 


Bias 


2.08E-5 


7. 


Linear 


4.11E-5 


8. 


Axon 


2.72E-6 


9. 


Tansig 


5.87E-6 


10. 


Logsig 


3.99E-6 




No of hidden layers 



1 1 



Figure 6. Mesh of variation of MSE with algorithm and hidden layer 



Table III. Details of optimized neural network model. 



Optimized parameters of the neural networks model 


Database 


Training base 


100 


Validation base 


33 


Test base 


33 


No of neurons in 


1 st layer 


10 


Transfer 
function of 


1 st layer 


Axon 


Output layer 


linear 


3 




Flow in 

m 3 /s 


Din 

m 


Density in 
Kg/m 3 


Temp in 

°C 


min 


0.000 


0.1 


500 


25 


max 


0.0025 


0.3 


1500 


75 
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VI. Results and conclusion 

The proposed optimized ANN is trained, tested, and 
validated with the simulated data. Once the training is over, 
the flow measurement system is subjected to various test 
inputs corresponding to different flow rates at a particular pipe 
diameter, liquid density, and temperature, all within the 
specified range. For testing purposes, the range of flow is 
considered from to 0.0025m /s, the range of pipe diameter is 
0.1 to 0.3 m, the range of liquid density is 500 to 1500 Kg/m 3 , 
and temperature ranges from 25 °C to 75 °C. The outputs of the 
proposed technique with optimized ANN are noted 
corresponding to various input flow rate with different values 
of pipe diameter, liquid density, and temperature. The results 
are listed in table-4. 

Table IV suggests that measured flow rates are same as 
actual flow rate. Mean square of % error for 15 different 
simulated test conditions is 0.0403. It may be noted that the 
test conditions in Table IV are different from the training data 
set. Table IV show that the proposed system is made adaptive 
of variations in pipe diameter, liquid density, and liquid 
temperature, and thus repeated calibration can be avoided. 

Available reported works have discussed different 
techniques for calibration of flow measurement, but these are 
not adaptive of variations in pipe diameter, liquid densities, 
and temperatures. Hence, repeated calibration is required for 
any change of these parameters. Some time the calibration 
circuit may itself be replaced which is a time consuming and 
tedious procedure. Further, most of the reported works have 
not utilized the full scale of measurement. In comparison to 
these, the proposed flow measurement technique achieves 
linear input output characteristics for full scale input range and 
makes the output adaptive of variations in pipe diameter, 
liquid density, and temperature. All these have been achieved 
by using an optimized ANN model. Four different models are 
considered for this purpose. BP trained by PSO algorithm with 
neuron transfer function as axon, is found to have achieved the 
desired MSE with only one hidden layers. This is in contrast 
to an arbitrary ANN used in most of the earlier reported 
works. 
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Table V. shows the results of proposed technique for various input conditions 



Actual flow in 
m 3 /s 


Pipe dia in 

m 


Liquid density in 

Kg/m 3 


Temp in 

°C 


Data conver 
o/p in V 


ANN o/p in 
V 


Measured flow 
in m/s 


Error in % 


0.0005 


0.15 


500 


75 


3.65 


0.9985 


0.0004992 


0.160 


0.0005 


0.20 


600 


70 


3.97 


1.0025 


0.0005012 


-0.240 


0.0005 


0.25 


700 


65 


4.13 


0.9992 


0.0004986 


0.280 


0.0010 


0.30 


800 


60 


1.09 


1.9987 


0.0009996 


0.040 


0.0010 


0.10 


900 


55 


1.18 


2.0062 


0.0010035 


-0.350 


0.0010 


0.125 


1000 


50 


1.43 


2.0084 


0.0010012 


-0.120 


0.0015 


0.175 


1100 


45 


0.41 


2.9967 


0.0014991 


0.060 


0.0015 


0.225 


1200 


40 


0.55 


2.9986 


0.0014988 


0.080 


0.0015 


0.275 


1300 


35 


0.63 


3.0081 


0.0015022 


-0.147 


0.0020 


0.325 


1400 


30 


0.50 


4.0068 


0.0020018 


-0.090 


0.0020 


0.075 


1500 


25 


0.88 


4.0031 


0.0020029 


-0.145 


0.0020 


0.050 


550 


20 


0.51 


3.9991 


0.0019988 


0.060 


0.0025 


0.20 


650 


70 


0.61 


4.9993 


0.0024998 


0.008 


0.0025 


0.10 


750 


33 


0.80 


5.0029 


0.0025003 


-0.012 


0.0025 


0.30 


850 


48 


0.69 


5.0051 


0.0025008 


-0.032 
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Abstract: 

With the tremendous growth of the web it becomes 
hard to cope up with the increasing demand of 
information on web, up to the level of user's 
expectation. Users always want most appropriate 
results. For this purpose Search Engines use Page 
Ranking Algorithms. Ranking Algorithms line up the 
results according to the user priority. This paper 
deals with "Page Rank Algorithm " which is used in 
the most famous search engine "Google". Our 
proposed algorithm acts as a layer onto the search 
engine which in turn gives better results to the user 
by disambiguating the query. To prove that our 
algorithm improves the results we have used 
Precision and Recall measures and compared both 
the algorithms on these measures. 

Paper description: 

A new Ranking algorithm is proposed in this paper 
which is a way better than the majorly used Page 
Rank algorithm. 

Keywords: 

Information retrieval, Page Rank Algorithm, 

Precision, Ranking Algorithm, Recall 

I. INTRODUCTION 

"World Wide Web" these words are embedded 
into our lives to such an extent that we cannot 
even think of separating them from ourselves. 
And if we do so it will affect our lives largely. 
Today, with such an increasing demand of 
information and network building, search 
engines have served a great purpose. With them 
information access has become a work of 
seconds. But the question arises whether these 
Search Engines are providing results according 
to user demand / perspective or they are giving 
just an another generalized results? To some 
extend we can say they are just giving 
generalized result on the user queries. For e.g. 
when a user enters "Cricket" Search engine will 



result the web pages related to sports as well as 
the insect. But it doesn't identify the need of the 
user i.e. if user wants to search for cricket 
(insect) or cricket (sports). In this paper we are 
going to propose a framework which resolves 
such type of ambiguities. 

II. LITERATURE REVIEW 

Information retrieval is the activity of obtaining 
information resources relevant to an information 
need from a collection of information resources. 
Searches can be based on metadata or on full- 
text indexing. Today Search Engines are highly 
used for the purpose of Information Retrieval 
and with such a high demand it becomes 
essential to refine the results and present user 
with the most appropriate top ranked results. 
Web Mining is the field which deals with such 
concepts. R. Cooley, B. Mobasher and J. 
Srivastava [9] proposed the definition of Web 
Mining, its taxonomy. They also proposed 
architecture of Web Usage Mining. In July 2000, 
R. Kosala, H. Blocked [10] pointed some 
confusion regarding the usage of the term Web 
Mining and its categories. Besides this they 
explored the connection between the Web 
mining categories and their related agent 
paradigm. David Hawkin [2] gave the descriptive 
view of Web Search engines and its components. 
Eric J. Glover, Steve Lawrence, Michael D. 
Gordon, William P. Birmingham, C. Lee Giles 
[3] proposed a search engine which not only 
produces the relevant results but also allow users 
to provide preferences in the form of an 
information need category. With the help of such 
information search engine provides more 
valuable results than by considering only the 
query. 



Dept of CSE, PESIT, Bangalore South Campus 



52 



Proceedings of RETCOMP - 2013, Volume 1 



To provide users with the results ranking 
mechanism is used. Till now many Ranking 
Algorithms have been proposed. S. Brin and L. 
Page [7] [11] proposed the ranking algorithm 
"Page Rank Algorithm" which is used in most 
popular search engine Google which prioritize 
web pages according to its inbound links. They 
used the concept of citation analysis such that a 
link coming from an important page was given 
high weight whereas page which was not so 
important was given a low weight. Also they 
gave a formula to calculate the Page Rank of 
pages in an iterative manner. 

J.Kleinberg [5] [6] gave us algorithm "Hyperlink 
Induced Topic Search" (HITS). He recognized 
two different forms of web pages called hubs and 
authorities. Authorities are pages having 
important contents whereas Hubs are pages that 
act as resource lists guiding users to authorities. 
Thus a good hub page for a subject points to 
many authorities pages on that content, a good 
authority page is pointed by any good hub pages 
on the same subject. Kleinberg said that a page 



may be a good hub and good authority at the 
same time and this circular relationship lead to 
the definition of an iterative algorithm HITS. 
This algorithm was used in search engine 
"CLEVER" but was not successful because of 
topic drift and efficiency problems. Also this 
algorithm worked on both Web Structure Mining 
as well as Web Content Mining. 

Wenpu Xing and Ali Ghorbani [12] proposed 
"Weighted Page Rank" (WPR) algorithm which 
is an extension of Page Rank algorithm. This 
algorithm assigns a larger rank values to the 
more important pages rather than dividing the 
rank value of a page evenly among its outgoing 
linked pages. This algorithm is based on Web 
Structure Mining. 

III. WEB MINING 

Web Mining is a branch of Data Mining that 
automates the extraction of information from 
Web documents and services. Web Mining can 
be categorized into three as shown in figure 1 : 
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Figure 1. Categories of Web Mining 



Web Content Mining helps exploring useful and 
relevant information from the content of web. 
Web Structure Mining helps find structure that 
exists between different web pages. Web Usage 
Mining considers the usage pattern of the user to 
find out the relevant data. 

IV. RANKING ALGORITHMS 

Ranking Algorithms plays an important role in 
ordering the results of search engine according to 
the user priority. Different ranking algorithms 
work on different perspective. Some ranking 
algorithms work on link structure, some on 
content analysis and others on the usage pattern 
of user. With the differing nature of ranking 
algorithm, many ranking algorithms have been 
developed. To understand the basic nature and 



use following algorithms are discussed in the 
next section. 

A. PAGE RANK ALGORITHM 

Page Rank Algorithm was developed by Brin 
and Page [7] [11] during their thesis at Stanford 
University. This algorithm is a graph based 
algorithm which uses link structure of web pages 
and considers back links for the calculation of 
ranks of web pages. 

For this purpose Brin and Page [7] [11] proposed 
a formula to calculate Page rank of a Page 'A' 
where Ti, T 2 ...T n are pages pointing to it. 
Formula is as follows: 



Paw = C i -«+*(=£ 



coy 






..(1) 
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Where, 

d, damping factor (whose value is generally 
0.85). It is used to stop other pages 
having too much influence) 
C (Tj), number of links going out of Ti 

PR (T,), Page rank of Page T ; 

Sum of page rank of all web pages corresponds 
to 1. It uses an iterative approach to calculate 
actual page rank of web pages initiating with 
page rank 1 for all web pages. 

B. WEIGHTED PAGE RANK 
ALGORITHM 

Weighted Page Rank algorithm was proposed by 
Wenpu Xing and Ali Ghorbani. [12] It is an 
extended version of Page Rank algorithm which 
decides the rank of pages on basis of both in 
links and out links of the pages. It doesn't divide 
the rank equally among all the outgoing links; 
instead it works on the basis of importance of 
Web pages. Wenpu Xing and Ali Ghorbani [12] 
gave the modified formula as: 



C. HYPERTEXT INDUCED TOPIC 
SELECTION (HITS) 

Kleinberg [5] [6] categorized web pages into two 
forms, hubs and authorities. Where authorities 
were the pages that contain important content or 
information specific to the query and hubs were 
the pages that contain links to authorities. Hence 
for a hub to be a good hub it should point to 
many authoritive pages on the specific content 
whereas a authority to be a good authority it 
should be pointed by many good hubs. Kleinberg 
said that pages may be a good authority and hub 
at the same time. This iterative relation formed 
the basis for HITS. In HITS there are two major 
steps: 

• Sampling: A set of relevant pages to the 
query are collected from the complete 
set of web pages. 

• Iterative Step: Finds hubs and 
authorities using formulas given below: 



H„ 



A P =1 



q£B{p)Hq- 



(5) 
.(6) 



PR{u) = (1 - d) + d 2 PR(y)W$ tt) W%h 
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Where, 

d, damping factor (whose value is generally 

0.85). It is used to stop other pages 

having too much influence) 
PR (Ti), Page rank of Page T ; 

(3) 
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w, 



(v,u) 



£ P €R(v) °p 

I„ 
IpeR(v) >p 



(4) 



/„ and I p represent the number of in links of 
page u and page p, respectively. 

O u and O p represent number of out links of 
page u and page p, respectively 

R (v) denotes the reference page list of page 
v. 



Where 

H p is hub weight of p 

A p is authority weight of p 

I(p) is set of reference pages 

B(p) is set of reference pages 

V. PROPOSED WORK 

Our proposed algorithm acts as a layer to the 
Page Rank Algorithm. It resolves the ambiguity 
of the query and rearranges the results as per the 
user requirements. For e.g. If user enters the 
query "Cricket" it may refer to Cricket as a sport 
or as a insect. Our algorithm resolves the 
ambiguity of these words and produces results 
according to the user preference. The framework 
of the proposed algorithm is given in fig 2. 
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Proposed 
Algorithm 



... 



Fig 2. Framework of Proposed Algorithm 



VI. RESULTS 

When the user enters a the word cricket, to be 
searched, Google results returns the pages both 
for cricket(sport) and cricket(insect) but the 
cricket(sports) pages are on the top priority and 
user need to traverse result pages for finding the 
cricket(insects). The results of Google are shown 
in fig 3. The results produced by our algorithm 



were much efficient as when we applied the 
measures on both the algorithms. We have 
compared both the algorithms (Page Rank and 
newly developed algorithm) on the basis of 
Precision and Recall measures. Precision is a 
measure of the ability of a system to present only 
relevant items. Recall is a measure of the ability 
of a system to present all relevant items. 
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Fig 4. Comparative graph for Precision 



Fig 5. Comparative graph for Recall 



Dept of CSE, PESIT, Bangalore South Campus 



56 



Proceedings of RETCOMP - 2013, Volume 1 



VII. CONCLUSION 

Different ranking mechanisms are used for 
different purpose out of which Page Ranking 
algorithm is most widely used. In this paper we 
have proposed an extension of Page Rank 
algorithm which helps to resolve ambiguities of 
search queries and provide appropriate results to 
users. Also we have compared both the 
algorithms (Page Rank algorithm and the 
proposed algorithm) on the basis of Precision 
and Recall measures. 
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Abstract 

The development of new imaging methods 
in various fields arises the need of meaningful 
combination of all available image datasets. Image 
fusion is the process that combines information from 
multiple images of the same scene without the 
introduction of distortion or loss of information. The 
aim of image fusion is to integrate complementary as 
well as redundant information from multiple images 
to create a fused image output. The new image 
generated should contain a more accurate description 
of the scene than any of the individual source images 
and is more suitable for human visual and machine 
perception or further image processing and analysis 
tasks. In recent years, image fusion has attracted 
increasing attention in a wide variety of applications 
such as concealed-weapon detection, remote sensing, 
intelligent robots, medical diagnosis, defect 
inspection, and military surveillance. This paper 
presents in detail the various categories, techniques 
and algorithms for image fusion with a comparative 
study. 

Keywords: Image fusion, image registration, spatial 
domain fusion, transform domain fusion, wavelet 
based fusion,PCA fusion. 

1.0 INTRODUCTION 

1.1 Image fusion 

Image fusion is the process that combines 
information from multiple images of the same scene, 
captured from different sensors. The main concept 
behind all image fusion algorithms is to detect strong 
salient features in the input sensor images and fuse 
these details to the synthetic image and the resulting 
image is referred as fused image. 

Let Xi(r).... x,(r) represent T images of size 
Mi x M 2 capturing the same scene, where r=(i,j) 
refers to pixel co-ordinates ( if) in the image. The 

process of combining the important features from the 
original T images to form a single enhance image y(r) 
* corresponding author 



is usually referred to as image fusion[20]. We 
categorize the image fusion according to the data 
entering the fusion and the fusion purpose [6]. 

1.2 Image fusion categories 

1.2.1 Multi view fusion 

It is the fusion of images from 
same modality which are taken at the same 
time but from different view points. 

7.2.2 Multi modal fusion 

It the fusion of images coming 
from different modalities such as PET, CT, 
MRI, visible, ultraviolet, infra red etc. 

1.2.3 Multi temporal fusion 

It refers to the fusion of images 
taken at different times in order to detect 
changes between them or to synthesize 
realistic images of objects which were not 
photographed in a desired time. 

1.2.4 Multi focus fusion 

It is the fusion of the images of 
3D scene taken repeatedly with various 
focal length. 
1.2.5 Fusion for image restoration 

It is of fusing two or more images 
of same scene and, each of them blurred and 
noisy may lead to a deblurred and denoised 
image. Multichannel deconvolution is a 
typical representative of this category. This 
approach can be extended to super 
resolution fusion where blurred images of 
low spatial resolution are fused to provide 
us a high resolution images. 

7.3 Basic stages of image fusion 

In each of the above said category, while 
fusing the images, due to the different sensor 
viewpoints, there might exist some mis- 
correspondence between several points of the 
observed scene .Therefore registration of images is 
done before fusing the images. Image Registration is 
the process of establishing point-by-point 



Dept of CSE, PESIT, Bangalore South Campus 



58 



Proceedings of RETCOMP - 2013, Volume 1 



correspondence between a number of images, 
describing the same scene. Image registration works 
usually in four steps. 
1.3.1. Feature detection 

The salient and distinctive objects 
(corners,line intersections, edges, contours, 
closed boundary regions etc.)are manually, or 
preferably automatically detected. 

1.3.2 Feature matching 

In this step, the correspondence 
between the features detected in the reference 
image is established. 

1.3.3 Transform model estimation 

Here, the type and parameters of the 
so called mapping functions, aligning the 
sensed image with the reference image are esti 
mated. 

1.3.4 Image re-sampling and transformation 

In this step,the sensed image is 
transformed by means of the mapping 
functions. Image values in non-integer co- 
ordinates are estimated by an appropriate 
interpolation technique. 

2.0 IMAGE FUSION TECHNIQUES 

There are various techniques that have been 
developed to perform image fusion [2]. They can be 
broadly classified into two groups 

1 . Spatial domain fusion techniques 

2. Transform domain fusion techniques. 

2.1. Spatial domain fusion technique. 

In this technique, the input images are 
fused in the spatial domain i.e using localized 
spatial features such as gradient, spatial frequency 
and local standard deviation. Assuming that g(.) 
represents the fusion rule, spatial domain fusion 
technique can be summarized as follows, 

Y(r) =g(xi(r), x 2 (r) x,(r)) 

The fusion process deals with the original 
pixel values [7]. Pixel level fusion uses a 
transformation on to perform data fusion. Some of 
the spatial domain fusion methods are, 

> Averaging Method 

> Brovey Transform method 

> PCA Method 

> IHS method 



2.7.7 Averaging method 

This is a basic and straight forward 
technique and fusion could be achieved by 
simple averaging corresponding pixels in 
each input images. Let the input images be 
Ii(x,y) and h(x,y) , then 



ijU,y) 



/i(.y,y) + /,(.v,,v) 
2 



2.7.2 Brovey transform based image fusion 

The basic procedure of the Brovey 
transform first multiplies each MS band by the high 
resolution PAN band, and then divides each product 
by the sum of the MS bands. The algorithm is shown 
in the equation below. 

DNFused = DNpan x DN hl / ( DN hl + 
DN b2 + DN b3 ) 
where DNfused means the digital number (DN) of 
the resulting fused image; DNbl, DNb2 and DNb3 
stand for pixel values of three bands of multiple 
spectral image; DNpan stand for pixel values of high 
resolution Pan band. 

2.7.3 Principal Component Analysis (PCA) based 
image fusion 

The PCA transform converts inter-correlated 
multi- spectral (MS) bands into a new set of 
uncorrelated components. To do this approach first 
we must get the principle components of the MS 
image bands. After that, the first principle 
component which contains the most information of 
the image is substituted by the panchromatic image. 
Finally the inverse principal component transform is 
done to get the new RGB (Red, Green, and Blue) 
bands of multi-spectral image from the principle 
components [2] . 



2.7.4 Intensity-hue-saturation (HlS)based 
fusion 



image 



The intensity-hue- saturation (HIS) fusion 
converts a color MS image from the RGB space into 
the IHS color space. The HIS components can be 
defined as follows: 

I=(R+G+B)/3 

H=(B-R) /3(I-R) , S=l-M, where G=MIN(R,G,B) 

H=(R-G) /3(I-G) , S=l-G/I, where G=MIN(R,G,B) 

H=(G-B)/3(I-B), S=l-B/I, where B=MIN(R,G,B) 
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where I,H,S stand for Intensity, Hue and 
Saturation components respectively; R, G, B mean 
Red, Green, and Blue bands of multi-spectral image. 

Because the intensity(I) band resembles a 
panchromatic (PAN) image, it is replaced by a high- 
resolution PAN image in the fusion. A reverse IHS 
transform is then performed on the PAN together 
with the hue (H) and saturation (S) bands, resulting in 
an IHS fused image. 

These image fusion methods are often 
successful at improving the spatial resolution, 
however, they tend to distort the original spectral 
signatures to some extent [5]. 

2.2 Transform domain fusion techniques. 

In this technique, source images 
are projected onto localized bases which are usually 
designed to represent the sharpness and edges of an 
image and therefore describe the image using a more 
meaningful representation that can be used to detect 
and emphasize the salient feature of images which 
are more clearly depicted than in the spatial domain. 

Let T{.} represents a transform operation 
and g(.) represents a transform rule. Transform 
domain fusion technique can be outlined as follows, 
Y(r)=T' fg(Tfxl(r)J, {x T (r)J) 

It was found that better results were 
obtained if the fusion was performed in the 
transform domain. The basic idea is to perform 
multi resolution decomposition on each source 
image, then integrate all these decompositions to 
form a composite representation, and finally 
reconstruct the fused image by performing an 
inverse multi resolution transform. 

The decade of 1980's saw the introduction 
of pyramid transform - a fusion method in the 
transform domain. An image pyramid [16] consists 
of a set of low pass or bandpass copies of an image, 
each copy representing pattern information of a 
different scale. At every level of fusion using 
pyramid transform, the pyramid would be half the 
size of the pyramid in the preceding level and the 
higher levels will concentrate upon the lower spatial 
frequencies. 

The basic idea is to construct the pyramid 
transform of the fused image from the pyramid 
transforms of the source images and then the fused 



image is obtained by taking inverse pyramid 
transform. Most commonly used pyramid transform 
fusion methods are 

1 . Laplacian Pyramid 

2. Filter Subtract Decimate Pyramid 

3. Morphological pyramid 

4. Ratio-of-low pass Pyramid 

With the development of wavelet theory , 
the multi-scale wavelet decomposition has taken the 
place of pyramid decomposition 11]. The wavelet 
theory was introduced as a mathematical tool 
in 1980s and it has been extensively used in image 
processing that provides a multi-resolution 
decomposition of an image in a biorthogonal 
basis and results in a non-redundant image 
representation. 

An alternative to fusion using pyramid 
based multi- resolution representations is fusion in 
the wavelet transform domain. The wavelet transform 
decomposes the image into low-high, high-low, high- 
high spatial frequency bands at different scales and 
low-low band at the coarsest scale[15]. 

Wavelet transform fusion schemes offer 
several advantages over similar pyramid based fusion 
schemes . 

(1) wavelet transform provides directional 
information while the pyramid 
representation does not introduce any 
spatial orientation in the decomposition 
process. 

(2) In pyramid based fusion, the fused 
images often contains blocking effects in 
the regions where the input images are 
significantly different. There are no such 
artifacts in similar wavelet based fusion 
results. 

(3) Images generated by wavelet image 
fusion have better signal-to-noise ratios 
than images generated by pyramid image 
fusion when the fusion rules are used. 

(4) When subject to human analysis 
also,wavelet fusion results are better 
perceived. 

The information flow in one level of 2-D 
image decomposition is illustrated in Fig. 3 and 3 
levels of decomposition is shown in fig 4. 

Wavelet separately filters and down 
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samples the 2-D image in the vertical and 
horizontal directions (separable filter bank). The 
input (source) image is I(x,y) filtered by low 
pass filter L and high pass filter H in horizontal 
direction and then down sampled by a factor 
of two (keeping the alternative sample) to 
create the coefficient matrices IL(x,y) and IH 
(x,y) . 
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Filtering 

row wise 
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Fig. 3. Information flow of one-level of 2-D 
decomposition 
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Fig. 4. 3 levels of 2-D decomposition 
The coefficient matrices IL(x,y) and IH(x,y) 
are both low pass and high pass filtered in vertical 
direction and down sampled by a factor of two 
to create sub bands (sub images) ILL(x,y) , 
ILH(x,y) , IHL(x,y) , and IHH (x,y) . 

The I LL (x,y) contains the average image 
information corresponding to low frequency 
band of multi scale decomposition. It could be 
considered as smoothed and sub sampled version 
of the source image I(x,y).lt represents the 
approximation of source image I(x,y). I L h 
(x,y) , I HL (x,y) and I HH (x,y), are detailed 
sub images which contain directional 
(horizontal, vertical and diagonal) 

information of the source image I(x,y) , due 
to spatial orientation. Multi-resolution could be 
achieved by recursively applying the same 
algorithm to low pass coefficients from the 
previous decomposition[14]. 



Inverse 2-D wavelet transform is 
used to reconstruct the image I(x,y), from 
sub images I L L(x,y), I LH (x,y) , I HL (x,y) and 
Ihh (x,y), as shown in fig. 5. This involves 
column up sampling (inserting zeros between 
samples) and filtering using low pass L and 
high pass filter H for each sub images. Row up 
sampling and filtering with low pass filter •£ and 
high pass filter //of the resulting image and 
summation of all matrices would construct the 
image I(x,y) 
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Notations: 1( insert 1 rolumn of zeros in between columns (up sampling in columns) 
\^ jj insert 1 row of zeros in between rows (up sapling in rows) 

Fig. 5. Information flow of one-level of 2-D 
reconstruction 

2.3 Image Fusion by Discrete Wavelet 
Transform (D WT) 

The information flow diagram of discrete 
wavelet- based image fusion algorithm is shown in 
Fig. 6 
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Fig. 6. Information flow of discrete wavelet 

based image fusion 

The fused image (// (x,y)) could be 

obtained by taking the inverse discrete 

wavelet transform (IDWT) as: 

T f (x,y)=IDWT 

[${DWT(I l (.x,y)),DWT(l 2 (x,y))}~\ 
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3. FUSION METHODS COMPARISON 

Meaningful comparison of image fusion 
methods is often dependent on the application. For 
some applications (e.g medical image fusion) the 
aim of the fusion is to combine perceptually salient 
image elements such as edges and high contrast 
regions. Evaluation of fusion techniques for such 
applications can only be effectively based on a 
perceptual comparison. Apart from visual 
appearance quantitative analysis is done over the 
fused images . Therefore the comparison is of two 
types. 

1 . Qualitative comparison 

2. Quantitative comparison 
3.7 Qualitative comparison 

For the visual evaluation the following 
criterion is considered:natural appearance , 
brilliance contrast,presence of complementary 
features, enhancement of common features etc. 




M 



tf *% 




Fig .8. (a) First image (MR) of medical test set (b) Second 
Image(CT) of medical test set (c) Fused image using 
Average Pixel values (d) Fused Image using DWT 





Fig .9. (a) First image of Multi focus test set (b) 
Second Image of Multi focus test set (c) Fused image using 
Average Pixel values (d) Fused Image using DWT 



By visual Comparison,Fig .8 shows that 
DWT preserves the subtle regions of the brain in the 
MR image better than the average pixel method. Also 
Fig. 9. shows that DWT gives more clear multi focus 
fused image than average pixel method. 

3.2 Quantitative comparison 

For Quantitative comparison, 

firstly a "ground truth" image has to be created that 
can be quantitatively compared to the fusion result 
images. This is produced using a single cut-and-paste 
technique, physically taking the "in focus" areas 
from each image and combining them. The 
qualitative measure used to compare the cut-and- 
paste image to the fused image was taken from [21]. 



= ^S^ 



, [Mw) -'/<*(*. i)f 



n' 2 

where I g , is the cut-and-paste "ground 
truth" image , Ifd is the fused image and N is the 
size of the image. Lower values of ** indicate 
greater similarity between the images I g , and Ifd and 
therefore more successful fusion in terms of 
quantitatively measurable similarity. 

4. EXPERIMENTAL RESULTS AND 
ANALYSIS 

The qualitative measure /' is calculated for 
Average Pixel, PCA and DWT fusion methods 
which are given in table . 1 



Fusion method 


P 


Average pixel 


8. 3553 


PCA 


8. 3737 


DWT 


7.6551 



Tab 1. Results of Average Pixel, PCA, DWT methods 

Table 1. shows the results of comparison of 
P values of all the three methods. It depicts that the 
value for average pixel value method is higher than 
PCA and DWT. So it gives only a baseline result. The 
PCA method gives a equivalent but still it is also a 
worse result. These methods have poor results when 
compared to DWT method . 
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5.0 CONCLUSION 

This paper presents the various categories, 
stages and techniques for image fusion. For an 
effective fusion of images, a technique should aim to 
retain important features from all input images. These 
features often appear at different positions and 
scales. Simple non-resolution methods of image 
fusion such as averaging and PCA methods produces 
only limited results. DWT method provide better 
results when compared qualitatively and 
quantitatively. So we inference that DWT fusion 
methods provide computationally efficient image 
fusion techniques and is most used among all the 
discussed methods in this paper . 
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Abstract — Gesture recognition has gained a lot of importance 
over the past few years. With the progress in technology, the need 
for faster communication between different devices has taken the 
priority. As has been the method so far, a number of clicks are 
required by the user before the actual communication between 
devices or data transfer from one place to another takes place. A 
better system would entail the user performing simple hand 
gestures towards the device and corresponding actions will take 
place in the system. Thus, with minimal effort from the user, 
efficient interaction can take place with and between systems. 

In this paper we propose an interactive system, which uses 
human gestures to perform an activity such as transfer of data 
between different devices (e.g mobile devices, computers, tablets, 
PC) 

KeyWords — HCI, Gesture Recognition, Mobile devices, Natural 
Actions, Applications of mobile embedded systems. 



I. Introduction 

Portable devices such as mobile phones, laptops, tablet etc. 
have turned into the basic necessities of the day. With the 
constant improvements in these fields, there is a tremendous 
use of mobile devices for various purposes like taking pictures, 
writing documents, showing presentations etc. 

Gesture recognition is an effective tool in language 
generation process [6], Engineering research also strives to 
create a system of device control using gesture recognition. In 
the present paper the authors aim to present a novel method of 
gesture recognition. 

The advancement in the operating system for mobile 
devices allows users to develop vast range of applications. At 
the end of the day there is also a need to use the same data on 
multiple systems or devices as well, for example the pictures 
might need to be edited before they can be shown to the rest of 
the world or there might be a need to add some notes using a 
tablet to be used on the system. 

Devices like Kinect controllers allow some easy interaction 
between the user and the gaming system and enhance the user 
experience. Such a system can also be used to communicate 
with the computer. But such systems suffer from noise 
disturbances where any third person can come between the user 
and the system and thus spoil the experience for the user. 



Also a number of online applications are available wherein 
the user can synchronize his/her files with another system. Yet 
again the total interaction needed for this purpose is not as user 
friendly as it could be. 

Mobile devices are very common and majority of the people 
use smart phones in their daily lives. Their interactions with 
these devices can be made easier by using such gesture 
recognition features. For example, if the user needs to transfer 
a file from his mobile device to the tablet or a computer in 
front of him, he can simply perform the drop action and the 
transfer will be done. To perform a move operation to another 
window, he can move the phone to the left or right depending 
on whichever side he wants to move the window to. 

All the concerned data will be stored over the cloud 
pertaining to the current user and no one else can access this 
data apart from that particular user. This adds to constraints 
where we need the user to be logged in to an application in the 
mobile devices and computer. The data transfer or the actions 
only take place if the user is authenticated to the system. Once 
this is done the user can simply transfer or control the other 
system using his hand or mobile device. 

This authentication before the use of API also helps 
improve security. Thus the inclusion of facial or voice 
recognition along with gesture recognition feature ensures a 
secure access to the services even after prolonged usage of the 
software and the hardware. 

Once the user's face or voice is recognized, an unhindered 
use of the gesture recognition feature is allowed for the next 
hour based on the security needs for the device after which the 
key will be issued again by the cloud. 

II.RELATED WORK 

[10] is a survey paper which speaks about the various gesture 
recognition techniques with an emphasis on hand gestures and 
facial recognition. [12] Talks about gesture recognition using 
Bezier curves from registered 3D data. [2] Presents a very 
novel method of gestures by using quadratic curves to 
recognize the gesture. [15] Talks about dynamic hand gestures 
via the use of Kalman filter based hand tracking and a HMM 
based temporal characterization scheme. [13] is another HMM 
based algorithm where time-sequential images is transformed 
into an image feature vector sequence, and eventually to a 
symbol sequence by vector quantization. [18] Explains a 
seamless way of passing data among multiple users and 
devices. 
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Viola and Jones proposed an algorithm, called Haar 
Classifiers, to rapidly detect any object, including human faces, 
using AdaBoost classifier cascades that are based on Haar-like 
features and not pixels [7]. 



the user activity and then the URL is transferred to the 
application where it can then be used appropriately. 

B. Application on the phone to other devices 



Haar classifier [3] is used to accurately detect facial features. 
Recognizing other gestures like hand movements requires that 
the Haar classifier cascades be trained first. 

Intel developed an open source library devoted to easing 
the implementation of computer vision related programs called 
Open Computer Vision Library (OpenCV). The OpenCV 
library, designed to be used in conjunction with applications 
that pertain to the field of HCI, robotics, biometrics, image 
processing, and other areas where visualization is important 
and includes an implementation of HAAR classifier detection 
and training [4]. 



III.FRAMEWORK 



If there is a file transfer from the phone to another device, the 
user needs to perform simple drop action like moving the 
phone downwards towards the system and the file that needs 
to be transmitted will be transferred to the system. Again the 
receiving system will notify the user once the download is 
finished on the system. For the live actions like switching 
windows or moving in the presentation, an initial pairing of 
the system with special parameters needs to be performed. 
Then the user can move his phone in different directions, 
without even being in the proximity of the system and perform 
all the required actions with the system. If we need to switch 
to the next window screen we move the phone from the 
horizontal position to vertical position. Fig. 1 explains how one 
application is transferring a video on the phone over to the 
user's browsers. 



Libraries are developed so that the developers can easily 
integrate the API calls to the data cloud without breaking any 
previous known functionality of the application. Libraries run 
asynchronously parallel to the application thread, hence the*- 
performance of the application is untouched. We use a simple 
application that communicates using the API, explaining how 
the above procedure happens. 2. 

3. 

Application first needs to authenticate the user by calling the 
Authentication API of the system. Once the authentication key 
is sent to the user, the application now registers all the events 
that gesture or device framework should look out for. 
Whenever the user performs a hand gesture over the camera or 
moves the phone in a particular gesture, event is fired by the 
system and the call back takes over to send the information to 
the cloud. The information here can be anything from a URL, 
or an image, a video or a document that is to be stored against 
a particular user. Any type of data can be stored over in the 
cloud to the user. If the application does not need to store and 
can function only by making use of the event services then it 
can disable the data usage while registering the API use. The 
services are split into two parts, one is the hand gesture 
recognition API's and the other is the device based gestures. 
The latter is for mobile devices that include a gyroscope and 
an accelerometer. Now we take a couple of use cases and see 
how they are combined and used. Authentication by default is 
assumed. 

A. Application on the desktop 

When the user needs to transfer a file from the desktop to 
another mobile device or another computer on the cloud, he 
does the natural action of grabbing the file from the current 
window and then going to the other system and dropping it in 
front of it. The current data on the device window, which may 
be a URL or a file, is transferred to the data cloud. And the 
other side where the drop action is performed is notified about 



IV.THE SYSTEM OVERVIEW 

1 . The application is synchronized to the cloud after the user is 
authenticated. Henceforth it will run in the background of the 
application. 

2. It waits and scans for events to be generated by the user. 

3. When the user performs a gesture, the event is recognized 
and is sent to the data cloud. If there is data present then it is 
sent along with the event to the cloud. 
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Fig 2. Over all Architecture 



4. Data cloud stores history of the activity performed by the user 
along with the data pertaining to the application where the 
activity is performed. 

5. When the user performs another action on the same 
application that the previous action that was performed then 
will be a transfer depending to the particular type of event. 

6. The application downloads the information in the required 
format use it to perform further processing. 

7. After every hour, for security the user face recognition must 
be performed, to get fresh key to the application which then 
can be used again. 



A. Mobile Gesture and Actions 

A mobile device has an accelerometer and gyroscope which is 
what we make use of to find out what gesture the user is 
performing. Accelerometer is a dynamic sensor capable of a 
vast range of sensing. Accelerometers that can measure 
acceleration in one, two, or three orthogonal axes are 
available. They are typically used in one of the following three 
modes: 

■ As an inertial measurement of velocity and position; 

■ As a sensor of inclination, tilt, or orientation in 2 or 3 
dimensions, as referenced from the acceleration of 
gravity (1 g = 9.8m/s 2 ); 

■ As a vibration or impact (shock) sensor 



external torque, its orientation remains nearly fixed, regardless 
of any motion of the platform on which it is mounted. 

From the above we derive the Rotational vector sensor that we 
can use in the device. The rotation vector represents the 
orientation of the device as a combination of an angle and an 
axis, in which the device has been rotated through an angle 9 
around an axis (x, y, or z). The following code shows you how 
to get an instance of the default rotation vector sensor. The 
three elements of the rotation vector are expressed as follows: 

x*sin(8/2) y*sin(8/2) z*sin(8/2) 

Where the magnitude of the rotation vector is equal to 
sin(9/2), and the direction of the rotation vector is equal to the 
direction of the axis of rotation. The three elements of the 
rotation vector are equal to the last three components of a unit 
quaternion (cos(9/2), x*sin(9/2), y*sin(9/2), z*sin(9/2)). 
Elements of the rotation vector are unit less. The x, y, and z 
axes are defined in the same way as the acceleration sensor. 
The reference coordinate system is defined as a direct 
orthonormal basis (see figure 3). This coordinate system has 
the following characteristics: 

X is defined as the vector product Y x Z. It is tangential to 
the ground at the device's current location and points 
approximately east. 

Y is tangential to the ground at the device's current location 
and points toward the geomagnetic North Pole.Z points toward 
the sky and is perpendicular to the ground plane. 

Now since we have the rotational values of the phone we can 
use that to find out what kind of moment the phone has 
moved. This is similar to tilt of the phone. But since the 
movement can be highly rotational in nature we use the above 
one. To improve the kind of actions we can use with the user 
we also use the accelerometer reading to figure out what the 
user is going to do. The values x , y , z, p , q , r that we derive 
from the above, where x, y, z are the acceleration values , p, q, 
r are the rotational vector values , compare with the data that 
we have predefined in the system. When the values have the 
distance less than the threshold to decide what action has 
occurred then we decide what event to rise. The Server is then 
notified about the action that is performed. If there is a drop 
action then the files are uploaded to the server. The server 
sends a notification to the mobile once the action has been 
completed on the device. 



A gyroscope is a device for measuring or 
maintaining orientation, based on the principles of angular 
momentum. In essence, a mechanical gyroscope is a spinning 
wheel or disk whose axle is free to take any orientation. 
Although this orientation does not remain fixed, it changes in 
response to an external torque much less and in a different 
direction than it would without the large angular momentum 
associated with the disk's high rate of spin and moment of 
inertia. Since mounting the device in gimbals minimizes 




Fig 3 
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B. The other client 

If the phone is the device that was used to instantiate the 
communication then the other devices receiving the actions 
are possibly another mobile device and a computer. For the 
system to know where the particular data must be transferred 
from the previous screen so that a 1-1 communication is done, 
it must be in ready state, always checking the environment for 
any sort of events that can occur. One thing that the system 
does is template matching to look out for any phone action 
that was performed by the user in front of the particular 
system. 

Template matching is a technique in digital image 
processing for finding small parts of an image, which match a 
template image. For templates without strong features, or 
when the bulk of the template image constitutes the matching 
image, a template-based approach may be effective. Since 
template-based template matching may potentially require 
sampling of a large number of points, it is possible to reduce 
the number of sampling points by reducing the resolution of 
the search and template images by the same factor and 
performing the operation on the resultant downsized images 
(multi-resolution, or pyramid, image processing), providing a 
search window of data points within the search image so that 
the template does not have to search every viable data point, 
or a combination of both. The next feature that it tracks is the 
hands using the fast HAAR classifier to find the hand in the 
frames of the image, which then follows the use of cam-shift 
to track the various hand positions. Now if an object like 
another phone or if a hand is found in the screen then we flag 
them and we follow their actions. If the server replies back 
with the positive saying a communication was done with the 
system then we can do the data transfer to the system. 



D. Efficiency 

The system on the whole works as a background process and 
only communicates when there is a particular event that is 
triggered by the the system. All the communication that occurs 
in the system happens with minimal data transfer between the 
systems hence the bandwidth of the user is not affected. No 
additional need of hardware resources is needed. It can be 
implemented across various platforms which have a camera 
and accelerometer installed in them. 



V.FUTURE ENHANCEMENTS 

• Security: 

To verify that the hand gestures are performed by an 
authorized person, the face recognition of the user feature is to 
be added. The user is required to verify him/her before using 
the gesture recognition feature. Once the face is verified, the 
gesture can be performed. We can also assure secured gesture 
recognized data transfer by verifying the thumb impression. 
This verification process can be repeated every hour. 

• The file selected by the gesture will be uploaded and will be 
waiting in the queue. If there is no sequential download action 
to other devices, then the file selection might not be the 
intended action. Hence, for the next action for the file transfer 
still have the same earlier file which will be present in the 
queue. Therefore we can consider the threshold time and 
beyond which the file from the queue has to be deleted. In 
another scenario wherein there are two sequential select 
actions, then the first file uploaded should be deleted. 
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Abstract: In this paper we give an 
innovative approach for extracting the 
crosspoints for various parameters for 
the same image using the Gabor filter. 
The different parameters used in Gabor 
filter are frequency, wavelength, x and y 
coordinates, phase angle, aspect ratio. 
Using this method we get different sets 
of crosspoints for the same image. So 
better approximate results can be 
gathered from this method as better 
clarities can be obtained in terms of 
classification and matching. Forensic 
experts get lots of problems when the 
error between a set of distances are 
within some specific decimal values. So 
if different intensities can be targeted 
using different frequencies, we get an 
alternative solution to an already 
existing one when the error is very 
small. This method also helps the 
medical physicians as it gives a 
confirmation whether the person has a 
disease or no. So we modify the different 
Shape Extractor algorithms by bringing 
in frequency as a parameter we get 
better confirmative results. 



1. INTRODUCTION 

Gabor Filters are cosine transform based 
filters which are harmonic in nature. 
These filters help to target all feature 
points in an image as intensities and 
frequencies are proportional to each other. 
In this paper we apply the Gabor Filter by 
first finding the texture of the image. Here 
the image is the retina. Fig 1 shows the 
sample image. 




Fig 1: Sample image. 



Keywords: Biometric, Gabor Filter, 
crosspoints. 



One such application on a retina is shown 
below. 
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Fig 2: Texture of an image 

The texture extracts the region where the 
arteries and veins are located. We can 
clearly see it from the figure. The textures 
for retinas in different directions (left or 
right) are shown below. 




Fig 3: Texture of left eye of fundus image 




The different textures are obtained by 
applying different thresholds on the 
intensities of the images [4] [8]. 

Gabor Filter 

A Gabor Filter is made up of an 
exponential term multiplied by cosine term 
which is a harmonic function. A Gabor 
Filter function is dependent on the 
dimension of the image, aspect ratio, 
wavelength and the phase angle. One 
example of Gabor matrix is given by: 
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Fig 4: Textures of right eye of fundus 
images. 



Fig 5: A well generated Gabor Matrix. 

The above figure shows a 7 by 7 matrix 
(Aspect Ratio=l). 

2. METHODOLOGY 

2.1 Gabor Filter Methodology of 
extracting the cross points: 

Detection of Bifurcation Points by 
Extraction of Shapes (Gabor Filter). 
After reading the image, it is converted 
into gray scale and then the maximum 
pixel intensity minus the minimum pixel 
intensity is found out for a pixel in a 
neighborhood. Then we use the line- 
structuring element for dilating the image. 
Finally convolve the image with a 3x3 
matrix to find the bifurcation points of 
shapes 'Y'[3][4] and mark these points. 
From all these methods we gather some 
information about some extra features 



Dept ofCSE, PESIT, Bangalore South Campus 



70 



Proceedings of RETCOMP - 2013, Volume 1 



which look to be outside the scope of 
bifurcation point's extraction. So we 
thought of going into Disease recognition 
(Diabetes) because these features mainly 
correspond to diabetic parameters. 11_ 27 
Then we thought of calculating the color 
parameters of these features in terms red, 
green, blue. From the literature we 
gathered, we could analyze that green 
component had a lot of role to play to 
extract the vessels, exudates and the 
optical ball where as the red component 
was significant to extract the micro 
aneurysms[4] and haemorrhages [4]. These 
features are a must to be extracted because 
unlike the fingerprints, retina has always a 
chance of getting diseased over a period of 
time. So Biometric systems based on retina 
also gives some extra information with 
respect to the behavior of a person in terms 
of disease. 

The steps for Gabor filter are as follows: 

(a) Read the image. 

(b) Convert the image to black and white. 

(c) Find Maximum-Minimum for each 
pixel. 

(d) Find edge of the image using Sobel 
operator. 

(e) Using structuring element as line, dilate 
the image. 

(f) Convolve the image with a three by 
three matrix which is of shape 'Y.' 

(g) Mark those points. 

The Gabor Filter takes the form 

Step 1 : Convolve the Gabor filter as 
shown below with the image texture. 



exp 



^+ 7 y' 



X 



2<7 S 



cos 2tt— + ip 



where 



x — x cos 9 + y sin 9 



and 



y = — x sin 9 + ycos 9 

x, y are co-ordinates X is wave length, 6 is 
the phase angle, v|/ is the phase offset, and 



y is the spatial aspect ratio, and specifies 
the ellipticity of the support of the Gabor 
function[2][6]. Here x and y takes different 
values generated from the Grid. 

Step 2: Store the Gabor matrix. 

The equations supporting Gabor transform 
are given by: 

for x = - fix(Sx) : fix(Sx) 

for y = - fix(Sy) : fix(Sy) 

G (fix (Sx) + x + 1 , fix(Sy) + y 
+1) = 

(1/ (2*pi*Sx*Sy)) * exp(-.5*((x/Sx) A 2 + 
(y/Sy) A 2)+2*pi*i*(U*x + V*y)) ; 

end 
end 

where Sx, Sy[6] means the variances in the 
x and y directions and U and V are centre 
frequencies respectively. The results are 
shown in Figure 6(a,b.c.d) from top to 
bottom. 




Fig 6.c 
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Fig 6.d 
Fig 6: The partial segmented image and 
the bifurcation points as shown in the 
fourth image 

The method applied to sample retinal 
images is as follows: 

Step 1: First the texture is extracted from 
the retina which gives the shape of the 
arteries and veins 





Fig 7. a 



Fig 7.b 



ULJ 

Fig 7.c Fig 7.d 

Fig 7: Texture of retinal images 




m 

Fig 8.b 

Fig 8: Some fundus images with set of 

crossover points. 

The above images show the textures and in 
addition to step 1, we find the required 
crosspoints by applying Gabor Filter. 

To extract the cross points, the Gabor 
Matrix is convolved with the texture of 
different sets of retina. 

Now applying this matrix for different sets 
of frequencies we get, 



f=255. 




Fig 9. a 



Fig 8. a 
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Fig 9.b 
Fig 9: (a)Texture of image (b)crosspoints 

of image for f=255 
f=90 



Fig 11. a 





Fig lO.a 




Fig 11. b 

Fig 11: (a)Texture of image (b)crosspoints 
of image for f=127 



f=190. 




Fig lO.b 

Fig 10: (a)Texture of image (b)crosspoints 
of image for f=90 



f=127 



Fig 12.a 




Fig 12.b 
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Fig 12: (a)Texture of image (b)crosspoints 
of image for f=90 



Step 2: To extract the cross points the 
Shape Vector is convolved with the image. 
Here we look for 'Y' shape or 'X' shape. 



3. RESULT AND ANALYSIS 

In this paper we have developed a novel 
approach for extracting Cross points for 
different sets of frequencies which gives a 
better confirmation of results of a single 
retina of one person. The logical 
conclusion of this paper is that 

a) The method works fine for dull 
images 

b) The only disadvantage of this method 
is that we get false points also 



4. CONCLUSION 

One can use gabor filter method can be 
used to extract the features of retina. From 
the above images we conclude higher the 
threshold, lesser intensity areas are 
grabbed and lesser the threshold, higher 
threshold areas are grabbed. We also see 
that different sets of feature points are 
gathered for different sets of frequencies. 



5. FUTURE WORK 

In future, work can be done in the area of 
number of features for diabetes and also 
thought can be given to find the diameter 
of the vessels or the optical ball or the 
fovea. The major problem for extracting 
the optical ball and the fovea is that the 
region of interest for the optical ball falls 
with the vessels and that for the fovea is 
that in the gray scale image the intensity 
values becomes equal to the intensity 
values for pixels of the vessels. 
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ABSTRACT 

Image fusion is the process of combining images taken from 
different sources, to obtain better situational awareness. The 
objective is to combine the most relevant information from 
source images into composite image. In the output image there 
should not be any introduction of artifacts or inconsistencies 
which will distract human observers or the further processing. 
There are many image fusion techniques based on signal, pixel, 
feature and symbol level fusion. The usefulness of the method 
has been done in various image pairs like multi-focus images, 
multi-sensor satellite image and medical images of cross-section 
of human brain. This paper analyses the various image fusion 
algorithms like the simple image fusion attempts, the pyramid 
decomposition image fusion, and the basic wavelet transform 
based image fusion like discrete wavelet transform and Haar 
wavelet transform. The Pareto optimization method proves that 
the quality produced by the Haar wavelet is better than other 
algorithms. Haar wavelet thus produces better results by the 
proposed gradient and smoothness criterion. 

Keywords 

Discrete Wavelet Transform, Haar Wavelet, Image fusion,Pareto 
optimization,Pyramid method. 

1. INTRODUCTION 

Information makes sense if it is able to convey its contents. 
Information can be recorded as signs, or transmitted as 
signals. The quality of the information is very important. 
Image fusion is the technique used to improve the quality of 
the information from the set of images. 

1.1 Image fusion Definition 

Image fusion is the process of combining information from 
two or more images of a scene into a single composite image. 
It should be more informative and is more suitable for visual 
perception or computer processing. Data fusion is a process 
of dealing with data and information from multiple sources to 
achieve refined/improved information for decision making. In 
data fusion the information of a specific scene acquired by 
two or more sensors at the same time or separate times is 
combined to generate an interpretation of the scene not 
obtainable from a single sensor. 

Image fusion, a component of data fusion is an effective way 
for optimum utilization of large volumes of image from 
multiple sources. Multiple image fusion seeks to combine 
information from multiple sources to achieve inferences that 
are not feasible from a single sensor or source. It is the aim of 
image fusion to integrate different data in order to obtain 



more information than can be derived from each of the single 
sensor data alone [1]. 

1.2 Why Image fusion? 

The fusion of images is often required for images acquired 
from different instrument modalities or for the same scene 
captured in different ways (like multi-sensor, multi-focus and 
multi-modal images). Multifocus image fusion is the process 
of combining information of two or more images of a same 
scene to result in all scene areas are in focus. When one scene 
contains objects in different distance, the camera can be 
focused on each object one after the other, creating set of 
pictures. Then, using image fusion technique, an image with 
better focus across all area can be generated. 

Medical image fusion has become a common term used in 
medical diagnostics and treatment. The term is used when 
multiple patient images are registered and merged to provide 
additional information. Fused images may be created from 
multiple images from the same imaging modality, or by 
combining information from multiple modalities, such as 
magnetic resonance image (MRI), computed tomography 
(CT). In radiology and radiation oncology, these images serve 
different purposes. Advantages of Image Fusion are clear 
with the following Fig. 1 where the image fusion mechanism 
improves the reliability by the redundant information and 
also improves the capability by complementary information. 

Redundant Information 




Complementary Information 



Fig. 1 Advantages of image fusion. 



The 



2. GENERIC REQUIREMENTS 

perception of the fused image is of paramount 



importance. When evaluating different fusion schemes some 
generic requirements can be imposed on the fusion result. 
The fused image should preserve as closely as possible all 
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relevant information contained in the input images [1]. The 
fusion process should not introduce any artifacts which can 
distract or mislead the human observer. In the fused image 
irrelevant features and noise should be suppressed to a 
maximum extent. 

The first step toward fusion, which may be interpreted as a 
preprocessing step is the registration which brings down the 
constituting images to a common coordinate system. This 
step is important as fusion of images is meaningful only 
when common objects in images have identical geometric 
configuration with respect to size, location and orientation in 
all the images [2], In the next step, the images are combined 
to form a single fused image through a judicious selection of 
proportions of different features from different images. 

3. IMAGE FUSION ALGORITHMS 

The most important issue concerning image fusion is to 
determine how to combine the sensor images. In recent years, 
several image fusion techniques have been proposed. The 
image fusion algorithm from the research is put into three 
categories as given in the Fig 2. In this section, these three 
categories are discussed with their limitations and their 
performances are compared based on simple metrics. The 
pair of medical images considered for evaluation of image 
fusion are CT scan image and MRI scan image. 

Simple image fusion attempts 



I 



Pyramid-decomposition-based image fusion 



♦ 



Wavelet-transform-based image fusion 

Fig. 2 Evaluation of Image Fusion. 

3.1 Simple fusion algorithms 

The simple fusion methods perform the fusion on the source 
images. This method uses the basic operations like addition, 
subtraction of the pixel intensities of the input images to be 
fused. 

The simple fusion methods are: 

■ Averaging Method 

■ Select Maximum/Select Minimum 

3.1.1 Averaging Method 

This method developed by Burt and Kolczynski [3] uses a 
normalized correlation between the two images. The resultant 
coefficient for reconstruction is calculated from weighted 
average of the two images coefficients that means averaging 
of every corresponding pixel of input image to get output 
image. Fig.3 shows the multi spectral images, which are used 
as input images for the fusion algorithms. 




Fig. 3 Pair of input images 

3.1.2 Select Maximum/Minimum Methods 

This simple scheme just picks the coefficient in each sub- 
band with the largest magnitude, that is, for every 
corresponding pixel in the input images the maximum or 
minimum intensity is selected [4]. Fig. 4 shows the output 
images which is fused based on the average and maximum 
methods. 




Fig. 4 Fused images by Average method and Maximum method 



The output images in Fig. 4 depicts that simple fusion method 
produces unsatisfactory results. When this direct method is 
applied, the contrast of feature uniquely presented in either of 
the images is reduced. This method is good only when the 
input images have a high brightness and high contrast. To 
solve this problem, several Pyramid decomposition based 
image fusion is proposed. 

3.2 Pyramid decomposition fusion method 

An image pyramid can be described as a collection of low or 
band pass copies of an original image in which both band 
limit and sample density are reduced in regular steps. An 
image pyramid is constructed for each of the input image, and 
then pyramid is formed for the composite image by selecting 
coefficients from the input image pyramids. Finally, the fused 
image is obtained by taking inverse pyramid transform. 

Several types of pyramid decomposition based methods 
developed for image fusion are: 

• Laplacian Pyramid 

• Ratio-of-low-pass Pyramid 

• Gradient Pyramid 

• Filter-Subtract-decimate (FSD) 

• Morphological Pyramid 

Pair of medical images given in Fig. 5 is considered as input 
for all types of pyramid decomposition method which is 
mentioned above. 
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Fig. 5 Pair of input images 



3.2.1 Laplacian Pyramid 

Burt and Kolczynski [3] introduced the Laplacian algorithm. 
The lowest level of the pyramid has the same scale as the 
original image and contains the highest resolution 
information. A strength measure (sum area, max, min) is used 
to decide from which source what pixels contribute at each 
specific sample location. Higher levels of the pyramid are 
reduced resolution and increased scale versions of the 
original image. 




Fig. 6 Fused image by Laplacian pyramid method 



The output image in Fig. 6 based on Laplacian pyramid based 
fusion has the following drawback. Often the fused images 
contain the blocking artifacts in region where it is going to 
fuse. 

3.2.2 Ratio-of-low-pass Pyramid 

Ratio of Low Pass Pyramid [7] is another pyramid in which 
at every level the image is the ratio of two successive levels 
of the Gaussian pyramid. In this case the fusion rule is to 
select at each pixel location i, j at the pyramid level L the 
pixel value from the largest deviation from unity from source 
images. 




Fig. 7 Fused image by Ratio pyramid method 



The output image in Fig. 7 based on ratio-of low-pass method 
has a major drawback where it does not contribute to the 
sharpness of an image. 



3.2.3 Gradient pyramid 

Gradient pyramid of an image is obtained by applying 
gradient operators to every level of its Gaussian pyramid G. 
The gradient operators are used in the horizontal, vertical, 
and 2 diagonal directions. 




Fig.8 Fused image by Gradient method 



Fig. 8 shows the fused output image which is obtained based 
on Gradient pyramid algorithm. This method considers low 
pass filters for the decomposition so, results in low quality. 

3.2.4 Filter- Subtract-decimate (FSD) 

This is similar to Laplacian fusion, the difference being in 
using FSD pyramid instead of Laplacian Pyramids. FSD 
Pyramid technique is a more computationally efficient 
variation of the Gaussian Pyramid. 




Fig.9 Fused image by FSD method 



Fig. 9 shows the fused image which is obtained by the FSD 
algorithm. This method decimates the input image matrices 
by neglecting every alternate row and column. 

3.2.5 Morphological Pyramid 

This method typically uses low or band pass fdters as part of 
the process. The filtering operations usually alter the details 
of shape and the exact location of the objects in the image. 
This problem has been addressed by using morphological 
filters to remove the image details without adverse effects. 
Morphological filters are composed of a number of 
elementary transformations: closing and opening 
transformations [10]. 

The opening operator can be expressed as a composition of 
two other operators: erosion followed by dilation, both by the 
same input structural element. The main mechanism under 
the erosion operator is the local comparison of a shape, called 
structural element. Structuring element is a matrix used to 
define a neighborhood shape and size for morphological 
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operations, including dilation and erosion. It consists of only 
0's and l's and can have an arbitrary shape and size. 




Fig. 10 Fused image by Morphological Pyramid method 



The Fig. 10 shows the output image which is obtained based 
on the Morphological pyramid. These algorithms differ in the 
way the decomposition and recomposition take place. A 
better fusion results has been achieved by wavelet transform. 

3.3 Wavelet transform based image fusion 

Transform of a signal is just another form of representing the 
signal. Mathematical transform are applied to signal to obtain 
further information which is not present in raw signal. 
Fourier Transform of a time domain signal gives frequency 
domain representation. Limitation of Fourier transform are 
when it is in time domain it will not give information 
regarding frequency and when it is in frequency domain it 
will not provide information regarding time. The ultimate 
solution is the wavelet transform which provides time 
frequency representation simultaneously. It provides variable 
resolution such as at high frequency wavelet transform gives 
good time resolution and poor frequency resolution and at 
low frequency wavelet transform gives good frequency 
resolution and poor time resolution. Discrete Wavelet family 
includes Haar Wavelet, Daubechies Wavelet are discussed 
here. 

3.3.1 Daubechies wavelets 

The Daubechies wavelets are a family of orthogonal wavelets 
defining a discrete wavelet transform. It is characterized by a 
maximal number of vanishing moments for some given 
support. With each wavelet type of this class, there is a 
scaling function (also called father wavelet) [6] which 
generates an orthogonal multiresolution analysis. A pair of 
medical images given in the Fig. 1 1 is considered as input for 
the fusion by the Daubechies wavelet based algorithms. 





Fig. 12 Fused image by Daubechies wavelet 

Fig. 12 shows the output image which is obtained by the 
Daubechies wavelets method. In general the Daubechies 
wavelets are chosen to have the highest number of vanishing 
moments so this does not imply the best smoothness. 

3.3.2 Discrete wavelet transforms 

A common wavelet analysis technique used for fusion is the 
discrete wavelet transform (DWT) [6] [8] [9]. In 2D image, a 
single level decomposition can be performed resulting in four 
different frequency bands namely LL, LH, HL and HH sub 
band and an N level decomposition can be performed 
resulting in 3N+1 different frequency bands as it is shown in 
Fig 13. At each level of decomposition, the image is split into 
high frequency and low frequency components. The low 
frequency components can be further decomposed until the 
desired resolution is reached. 

In practice when wavelet decomposition is used for image 
fusion, one level of decomposition can be sufficient, but this 
depends on the ratio of the spatial resolutions of the images 
being fused. The conventional DWT can be applied using 
either a decimated or an un-decimated algorithm. In the 
decimated algorithm, the signal is down sampled after each 
level of transformation. In the case of a two-dimensional 
image, down-sampling is performed by keeping one out of 
every two rows and columns, making the transformed image 
one quarter of the original size and half the original 
resolution. The decimated algorithm can therefore be 
represented visually as a pyramid, where the spatial 
resolution becomes coarser as the image becomes smaller. 
The decimated algorithm is not shift-invariant, which means 
that it is sensitive to shifts of the input image. The decimation 
process also has a negative impact on the linear continuity of 
spatial features that do not have a horizontal or vertical 
orientation. These two factors tend to introduce artifacts 
when the algorithm is used in applications such as image 
fusion. 



Fig. 11 Pair of input images 



LL- 


HL 2 


HL 1 


LH 2 


HH 2 


LH 1 


HH 1 



Fig.13 The Decomposition levels. 
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3.3.3 Haar wavelet 

Haar wavelet is a sequence of rescaled "square-shaped" 
functions which together form a wavelet family or basis as 
shown in Fig 14. There are a wide variety of popular wavelet 
algorithms, including Daubechies wavelets, have the 
disadvantage of being more expensive to calculate than the 
Haar wavelets. The higher resolution provided by other 
wavelets is not worth the cost for financial time series, which 
are characterized by jagged transitions. 



Fig. 14 Haar wavelet 

The Haar wavelet's function can be described as 



i o < t < y 2 , 

1 '/2<t<l, 

otherwise 



¥(t)=< 

v. 

Its scaling function can be described as 



O(t) 



< t < Vi, 
otherwise 



Using the discrete wavelet transform, any sequence of even 
length can be transformed into a sequence of two -component- 
vectors. Each vector right-multiplies with the matrix, gets the 
result of one stage of the fast Haar-wavelet transform. The 
sequences s and d is usually separated and continues with 
transforming the sequence s. Sequence s is often referred to 
as the averages part, whereas d is known as the details part. If 
a sequence of length is a multiple of four, then blocks of 4 
elements can be built. Transform these blocks in a similar 
manner with the 4x4 Haar matrix which combines two stages 
of the fast Haar-wavelet transform. 



H \ 



The Haar transform is the simplest of other wavelet 
transforms. This transform cross-multiplies a function against 
the Haar wavelet with various shifts and stretches. The Haar 
transform is derived from the Haar matrix. An example of a 
4x4 Haar transformation matrix is shown as 
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Fig. 15 shows the pair of input images used by the Haar 
wavelet algorithm. 




Fig. 15 Pair of input image for Haar wavelet 




Fig. 16 Fused image by Haar wavelet 

Fig. 16 shows the output image which is produced by Haar 
method. The Haar wavelet transform has a number of 
advantages compared to other algorithms. 

• It is conceptually simple. 

• It is fast. 

• It is memory efficient, since it can be calculated in 
place without a temporary array. 

• It is exactly reversible without the edge effects that 
are a problem with other wavelet transforms. 

• The Haar transform is the simplest orthogonal 
wavelet transform. It is computed by iterating 
difference and averaging between odd and even 
samples of the signal. 2-D images needs to compute 
the average and difference in the horizontal and then 
in the vertical direction. 

Shivsubramani, K P Soman [5] used simple metrics for the 
comparison of fusion algorithms. The simple metrics are, 
Mean Squared Error(MSE), Peak Signal to Noise Ratio 
(PSNR), Average difference(AD), Structural content(SC), 
Normalized Cross Correlation(NCC), Maximum 
Difference(MD). These metrics used to assess the best fusion 
algorithms in terms of quality using Pareto optimality 
method, which is a commonly used tool in multi optimization 
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when the different metrics cannot be compared. Table 1.1 
gives the information about the image quality metrics which 
is used by all the image fusion algorithms to validate the 
results based on the image metrics readings. 

Table 1.1 Image metrics for a pair of medical images 



Algorithm 


MSE 


PSNR 


AD 


SC 


NCC 


MD 


Average 


177.28 


25.62 


-9.57 


0.85 


1.08 


82 


Maximum 


116.5 


27.47 


-1.22 


0.98 


1.01 


22 


FSD 


157.12 


26.17 


-10.4 


0.09 


1.08 


96 


Laplacian 
pyramid 


305.43 


23.28 


-16.47 


0.08 


1.1 


92 


Gradient 
Pyramid 


146.26 


26.48 


-9.57 


0.86 


1.07 


74 


Ratio-of- 
low-pass 
Pyramid 


166.11 


25.93 


-0.58 


1 


1 


56 


Morpholo- 
gical 
pyramid 


1855.1 


15.45 


-39.29 


0.64 


1.23 


104 


Daubechies 
wavelet 


351.2 


22.58 


-17.07 


0.82 


1.1 


66 


Haar 
wavelet 


244.79 


24.24 


-14.39 


0.83 


1.09 


84 



The results are verified based on the image quality metrics. 
Ten respondents were selected in random and asked to select 
image they visually understood to be the best quality. It was 
found that out of the 30 assessments, The Haar based fusion 
is 63.33% best. The assessment saw that the fused images 
produced by Morphological pyramid method were rated most 
low in quality, so it becomes dominated algorithm. Haar 
wavelet is used for ease of implementation. It produces the 
output without the loss of generality and to work on any 
specific types of images. In Haar wavelet the fusion can be 
done based on proposed gradient and smoothness criterion 
ensures that the area of uniform intensity is incorporated in 
the fused images thus the effect of noise is minimized. 

4. CONCLUSION 

This paper analyses the various image fusion algorithms. The 
most important issue concerning image fusion is to determine 
how to combine the sensor images. In recent years, several 
image fusion techniques have been proposed. The simple 
fusion schemes perform the fusion right on the source 
images. One of the simplest of these image fusion methods 
just takes the pixel-by-pixel gray level average of the source 
images. This simplistic approach often has serious side 
effects such as reducing the contrast. With the introduction of 
pyramid transform some more sophisticated approaches 
began to emerge. It was found that better results were 
obtained if the fusion was performed in the transform 
domain. Several types of pyramid decomposition or multi- 
scale transform are developed for image fusion but the way 
the decomposition and recomposition phases differ in each of 



pyramid algorithms. The multi-scale wavelet decomposition 
has begun to take the place of pyramid decomposition for 
image fusion. It retains most of the advantages for image 
fusion but has much more complete theoretical support. The 
Haar wavelet which performs the fusion under the proposed 
gradient and smoothness criterion achieves the best results 
than other algorithms. Because of its efficient performance 
Haar wavelet can be considered to be the best method to be 
used for the fusion of CT and MRI images. The level of 
information obtained in the fused images using Haar wavelet 
transform can be highly beneficial in medical field. 
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Abstract -As we are in the development 
phase of our own super computer, we have 
identified several applications which are 
highly computationally intensive 

applications for a normal desktop computer 
to achieve the solution. These identified 
applications are related to multidisciplinary 
like bio-medical, mathematics, fluid- 
dynamics, genetic algorithms. We are 
actually identifying the parallel 
computations involved in each application 
which are independent to each other and 
optimizing it to perform better in our 
design of supercomputer. We basically use 
clustered node approach in which one 
master node distributes the jobs to multiple 
client nodes after identifying parallel 
computations involved in the application 
which are independent to each other. The 
application uses massively computationally 
intensive huge data which are SIMD based. 
We have around 20 nodes with 2 GPUs per 
node to optimize the computation. We have 
developed a cluster with job scheduler code 
to distribute the task to multiple clients 
after identifying parallel computations 
which are independent. This paper gives us 



a great satisfaction in studying the concept 
of high-performance in various fields of 
computationally intensive applications. The 
main application we focussed on is retinal 
pattern matching using shape extractors 
using the fast computing method of 
extracting bifurcation points. 



KEYWORDS: SIMD, Biometrics, Parallel 
computing, High Performance 
Computing. 



1. INTRODUCTION 

In the field of Forensic Science it has been 
very difficult to get a Biometric which is 
unique for any person in the sense that it 
doesn't change for the same person under any 
circumstances. Many of the Forensic scientists 
are concentrating on an area where they are 
sure of good results in terms of robustness, 
uniqueness, measure. There were many trials 
made on the concept of correlation between 
different biometrics. But since the human 
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brain has different set of neurons controlling 
the sensory organs, it is but very difficult to 
correlate the different biometrics [1]. So 
Forensic Science and Ophthalmology has 
made an attempt of finding a Biometric which 
finds unique features which are unique for a 
person. Cross points on the retina help to 
identify diseases and to get other vital 
information which relates to thermal imaging 
where in we see the blood flow in the arteries 
and veins of the retina [1][2]. In case of 
thermal imaging thresholding is applied to find 
exactly the mental state of the person at a 
particular instance of time. Diagrammatically 
anatomy and physiology of the retina looks 
like as shown in the figure. 
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Fig 1 : The above figure shows the rods and 

cones which lead to the formation of 

Bifurcation points. 



Posterior cli-amber 




This Biometric which is Retina which is 
located on the back side of the eye has unique 
points which are Bifurcation points or 
Crossover points. From the above diagrams, 
the medical conclusion is that rods and cones 
lead to the formation of cross points which are 
otherwise called Bifurcation points. 



2. IMAGE PREPROCESSING 

In this paper we concentrate on the boundaries 
of Retina using the Sobel filter. The normal 
retinal image( 1 .jpg) sample is shown below. 
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Different instances of edges of l.jpg are as 
follows. 




• •|..*UU/I».|X . h m 9, 



J^,^ **»*><»? 



a ■■4./ 






£v*^ 



9 ■:♦ 3 .','l»»ixim»l» 



Fig 2: Shows the boundaries of retina. 

The boundaries are found since it becomes 
easy to evaluate the cross over points from the 
grey level image, when the image is tracked 
from left to right. 



1 


1 


1 





1 





1 





1 



These shapes are analyzed in a three by three 
neighbourhood [3]. 

In general if we have a matrix which has a 
representation 



1 
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X 





X 


1 





1 



This kind of matrix helps to approximate the 
shapes of the objects in search to the above 
two shapes. 

The above matrices helps in extracting the 
shapes of entities where in we can spot the 
crossover points. 

Proceeding further with the above algorithm 
we have, 

Step 1: Extract 'Y' or 'H' shape from the 
above retina. 

So we have the following possibilities. 



3. BIFURCATION POINT EXTRACTION 

The crosses over points on the retina 
are evaluated by moving a three by three 
matrix over the retina by applying the 
principle of convolution. 



The matrix looks like: 



1 








1 
1 
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In a retina shapes can take the following 
possibilities (along with one mentioned 
above): 
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4. RESULTS AND ANALYSIS 
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Fig 3: Shows the different shapes for 
Cross point extraction. 
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Here the blocks are marked by 00, 01, 02, 10, 
11, 12,20,21,22. 



Step 2 : Once the shapes of the 

arteries and veins have been 
recognized in the above two 
categories, we mark the 
central pixel in the matrix 
which is 1 1 . 

The shapes belong to following categories. 



Step 3 



The distances between these 
points is calculated by using 
the Euclidean Distance 
formula [6]. 



d(x,y) = Z A/( x i- x j) 2+ (yi-yj) 2 



Fig 4: Shows the list of images (retina's) 
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The crosspoints for an image 1 jpg are 
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Ram hospital lack clarity due to the diverse set 
of patients this hospital gets. 

By performing a comparative study with the 
paper by vb, etal, we come to a logical 
conclusion that our work is an innovative 
extension of this work since the crossover 
points are caught on real time images. 

One such result from the above paper is shown 
below: 







<& & I :-: * o | / s s> *t | x v bi H ' u 



Fig 5: Shows the extraction of Cross points 
using Shape Extractor. 



The above image shows the Bifurcation points 
or the cross points on the retina(l. jpg). 

The real time images taken from Sir Ganga 



In general we have the following 
outputs. 
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The distances stored for all these images 
between the crossover points are as follows: 

12345. 9876 

16756. 8876 

14567. 9876 

13456. 87654 

These distances are unique for any person. 

Therefore comparing our results with the 
literature survey's we come to a conclusion 
that 

a) Algorithm for Shape Extraction works 
very well for a large database for 
patients of various diversities, which is 
faster than sequential computing. 

b) The distances between all the crossover 
points is also unique for any individual, 
which has helped the parallel computing. 

c) Algorithm gives good results when the 
images are not clear in terms of 
matching. The only disadvantage of the 
algorithm is that it generates a set of 
false points, which are undesirable, 
which can be overcome by image 
registration. 



5. FUTURE WORK 

In future, work can be done in the area of 
number of features for diabetes and also 
thought can be given to find the diameter of 
the vessels or the optical ball or the fovea. The 
major problem for extracting the optical ball 
and the fovea is that the region of interest for 
the optical ball falls with the vessels and that 
for the fovea is that in the gray scale image the 
intensity values becomes equal to the intensity 
values for pixels of the vessels. But using 
color components these parameters can be 
improved and further work can be continued 
using Probabilistic Neural Network and also 
Fuzzy Neural Network. 
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Abstract — Solar energy is the abundant energy available 
everywhere at no cost. India is blessed with the tropical climate 
with a good percentage of solar energy. Solar water heater is one 
of the household appliances used in most of the Indian houses. A 
system is proposed in this paper which studies the effect of tilt 
angle of collectors on the efficiency of water heater. The objective 
of the proposed work are to calculate the solar noon based on the 
location, and from the computation of solar noon the tilt angle of 
the collector is calculated to achieve maximum efficiency. The 
proposed technique was subjected to several type of solar water 
heater and the system produced desired outputs 

Keywords- Solar heater, Solar noon, LabVIEW 



I. 



Introduction 



Solar energy is the energy derived from the sun through 
the form of solar radiation. Solar powered electrical generation 
relies on photovoltaic 's and heat engines. A partial list of other 
solar applications includes space heating and cooling through 
solar architecture, day lighting, and solar hot water, solar 
cooking, and high temperature process heat for industrial 
purposes. Solar technologies are broadly characterized as 
either passive solar or active solar depending on the way they 
capture, convert and distribute solar energy. Active solar 
techniques include the use of photovoltaic panels and solar 
thermal collectors to harness the energy. Passive solar 
techniques include orienting a building to the Sun, selecting 
materials with favorable thermal mass or light dispersing 
properties, and designing spaces that naturally circulate air. 
The solar radiation incident on the surface of the earth can be 
conveniently utilized for the benefit of human society. One of 
the popular devices that harness the solar energy is solar hot 
water system (SHWS). A solar water heater consists of a 
collector to collect solar energy and an insulated storage tank 
to store hot water. The solar energy incident on the absorber 
panel coated with selected coating transfers the hat to the riser 
pipes underneath the absorber panel. The water passing 
through the risers get heated up and are delivered the storage 
tank. The re-circulation of the same water through absorber 
panel in the collector raises the temperature to 80°C 
(Maximum) in a good sunny day. The total system with solar 
collector, storage tank and pipelines is called solar hot water 
system. Broadly, the solar water heating systems are of two 
categories. They are: closed loop system and open loop 



system. In the first one, heat exchangers are installed to 
protect the system from hard water obtained from borewells or 
from freezing temperatures in the cold regions. In the other 
type, either thermosyphon or forced circulation system, the 
water in the system is open to the atmosphere at one point or 
other. The thermosyphon systems are simple and relatively 
inexpensive. They are suitable for domestic and small 
institutional systems, provided the water is treated and potable 
in quality. The forced circulation systems employ electrical 
pumps to circulate the water through collectors and storage 
tanks. Based on the collector system, solar water heaters can 
be of two types [1-10]. 

We are blessed with Solar Energy in abundance at no cost. 
The solar radiation incident on the surface of the earth can be 
conveniently utilized for the benefit of human society. One of 
the popular devices that harness the solar energy is solar hot 
water system (SHWS). Electricity is expensive not available 
due to power cuts in many areas when required for heating 
water. Solar Water Heater, since it stores hot water in an 
insulated tank, provides water all the time when required. The 
aim & objective of this work is to provide basic information 
on the components, system efficiency and types of solar water 
heaters currently available and the economic and 
environmental benefits of owning a system. Beside that we are 
trying to provide some computer programs which are essential 
for calculating the efficiency. Also we are trying to focus the 
effect of tilt angel, aperture area and different storage capacity 
on the performance [1-10]. 

In [11], provides a grey-box modeling approach based on 
fuzzy system to predict the outlet water temperature of a solar 
water heating system is discussed. A computer simulation to 
predict the performance of solar domestic hot water systems is 
discussed in [12]. The economical benefits of utilizing solar 
water heating systems for low-income dwellings in Brazil 
have been analyzed in [13]. In [14], a new design of roof- 
integrated water solar collector by integrating the collector 
into roof. Its main concept is based on the use of water 
redistribution for changing the roof configuration. An 
alternative for a large solar thermal electric plants having 
steam Rankine Cycle is discussed in [15]. An experimental & 
numerical study of thermal stratification in a mantle tank has 
been carried out in [16]. 
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The paper is organised as follows: after introduction in 
Section-I, a brief description of block diagram is given in 
Section-II. Section-Ill deals with the problem statement 
followed by proposed solution in Section-IV. Result and 
discussion is given in Section-V. 



II. 



Objectives of proposed work 



Objectives of the proposed work can be stated as 

• To test the collector and find out the efficiency 

• Provide the efficiency certificate to the manufacture 

• Make some computer programming for efficiency 
calculation 

• Discuss the effect of tank capacity, aperture area and tilt 
angle on the efficiency 

III. Experimental setup 



vnm:xl"rk-i 




Figure. 1 Block diagram of proposed instrument 




Figure 2. Picture of actual model 

To find out the efficiency of solar water heating system we 
need a solar water heater whose efficiency is to be found. In 
India there are different manufacture companies present who 
made this type of water heater mainly Anu solar, Kirti solar 
etc. Mainly 100 liters and 200 liters capacity system are 
available in the market for domestic water heating. Evacuated 
tube collector is used in this type of system. The tube length is 
varies from 1500 mm to 1800 mm mainly. The inner diameter 
of the tube generally 47 mm and the outer diameter is 58 mm. 



The total numbers of tubes varies from 10 to 15 for 100 LPD 
(liter per day) systems. For 200 LPD systems it will be 30. 
The solar absorbance of the selective coating is greater than 
0.86 and the solar emittance is less than 0.08. The components 
of solar water heating system are 

(i) Solar collectors(ETC) 

(ii) Insulated storage tank for storing hot water 

(iii) Tank and collector stand along with other instrument 

All physical measurements, such as, length, width, 
thickness, etc. shall be measured and reported in mm. Gross 
area of solar collector is defined as the total area on which 
solar radiation is falling (taking into account the gap, if any, 
between the tubes, and also the frame etc.). For flat plate solar 
collectors, it shall be calculated by multiplying width and 
length of the collector measured from outer edge to the outer 
edge. For the solar water heating models using all-glass ETCs, 
length shall be measured from bottom side of the storage tank 
to the outer edge of bottom tube holder while width shall be 
taken as equal to the length of the bottom tube holder. The 
measurement accuracy of the gross area shall be + 0.1 %. The 
volume of the storage tank shall be measured to an accuracy of 
+ 1%. 

Solar radiation shall be measured using a class-I 
Pyranometer on the plane of solar collector. For this, a 
mounting stand shall be used. 

For measuring ambient air temperature, the measuring 
sensor (a calibrated RTD) shall be located shaded by a 
Stevenson screen in the vicinity of the test set-up (not more 
than 10 m from it. It shall be ensured that there is no 
obstruction by any structure or building to alter (block or 
enhance) the free flow of the natural wind to the sensor. The 
outside surface of the Stevenson screen shall be of light 
colour, preferably white, and its bottom shall be kept at least 1 
m above the ground level. 

The surrounding air speed shall be measured on the 
collector surface at about its middle every half an hour with an 
accuracy of + 0. 1 m/s and the average value of the day will be 
reported along with the test results. 

IV. Problem solution 

A: SOLAR CONSTANT 

The eccentricity of earth orbit is such that the distance 
between the sun and earth varies by 1.7%. The radiation 
emitted by the sun and its spatial relationship to the earth 
result in a nearly fixed intensity of solar radiation outside the 
earth atmospheres which is called as a solar constant (Gsc). Its 
value is 1367 w/m2 according to the World Radiation Center 
(WRC). 

B: SOLAR RADIATION MEASUREMENT 

Solar radiation flux is measured with the help of 
pyranometer, pyrheliometer and sunshine recorder. A 
pyranometer is used to measure solar irradiance on a planar 
surface. A pyranometer"s main components are, A thermopile 
sensor with a black coating. This sensor absorbs all solar 
radiation, has a flat spectrum covering the 300 to 50,000 
nanometer range, and has a near-perfect cosine response. A 
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glass dome. This dome limits the spectral response from 300 
to 2,800 nanometers (cutting off the part above 2,800 nm), 
while preserving the 180 degrees field of view. Another 
function of the dome is that it shields the thermopile sensor 
from convection. The black coating on the thermopile sensor 
absorbs the solar radiation. This radiation is converted to heat. 
The heat flows through the sensor to the pyranometer housing. 
The thermopile sensor generates a voltage output signal that is 
proportional to the solar radiation. 




Figure. 3. Pyranometer 

C: RADIATION ON A SLOPE SURFACE 

The diffuse radiation model is composed of three 
parts. The first is an isotropic part, received uniformly from 
the entire sky dome. The second is circumsolar diffuse, 
resulting from forward scattering of solar radiation and 
concentrated in the part of the sky around the sun. The third, 
referred to as horizon brightening, is concentrated neat the 
horizon and is most pronounced in clear skies. It is necessary 
to know or estimate the solar radiation incident on tilted 
surfaces such as solar collectors, windows, or other passive 
system receivers, The incident solar radiation is the sum of a 
set of radiation streams including beam radiation, the three 
components of diffuse radiation from the sky, and radiation 
reflected from the various surfaces seen by the tilted surface. 
The total incident radiation on this surface can be written as 



IT = ITb + ITdiso + ITdcs + ITdhz + ITrefl 



(1) 



where the subscripts iso, cs, hz, and refl refer to the 
isoiropic, circumsolar, horizon, and reflected radiation 
streams. For a surface (a collector) of area Ac, the total 
incident radiation can be expressed in terms of the beam and 
diffuse radiation on the horizontal surface and the total 
radiation on the surfaces that reflect to the tilted surface. The 
new equation would be 

ACIT = IbRbAC + Id,isoASFs-c + Id,csRbAC + 
Id,hzAhzFhz-c + S IipiAiFi-c (2) 

The first term is the beam contribution. The second is the 
isotropic diffuse term, which includes the product of sky area 
As (an undefined area) and the radiation view factor from the 
sky to the collector Fs-c The third is the circumsolar diffuse, 



which is treated as coming from the same direction as the 
beam. The fourth term is the contribution of the diffuse from 
the horizon from a band with another undefined area Ahz. The 
fifth term is the set of reflected radiation streams from the 
buildings, fields, and so on, to which the tilted surface is 
exposed. The symbol i refer to each of the reflected stream: Ii 
is the solar radiation incident on the ith surface, pi is the 
diffuse reflectance of that surface and Fi-c is the view factor 
from the ith surface to the tilted surface. The equation can be 
rewritten in a useful form by interchanging areas and view 
factors (since the view factor reciprocity relation requires that 
ASFS-C = ACFC-S. This eliminates the undefined areas As 
and Ahz. The area Ac appears in each term in the equation and 
cancels. So 

IT = IbRb + Id,isoFc-s + Id,csRb + Id,hzFc-hz + IpgFc-g (3) 

This equation, with variations, is the basis for methods of 
calculating IT. When IT has been determined the ratio of total 
radiation on the tilted surface to that on the horizontal surface 
can be determined by 



R = 



total radiation on the titled surface 
total radiation on horizontal surface 



(4) 



D: THERMOSYPHON 



Thermosyphon refers to a method of passive heat 
exchange based on natural convection, which circulates liquid 
without the necessity of a mechanical pump. This circulation 
can either be open-loop, as when liquid in a holding tank is 
passed in one direction via a heated transfer tube mounted at 
the bottom of the tank to a distribution point even one 
mounted above the originating tank or it can be a vertical 
closed-loop circuit with return to the original vessel. Its 
intended purpose is to simplify moving liquid and or heat 
transfer, by avoiding the cost and complexity of a 
conventional liquid pump. 




collector 



Figure 4. Thermosyphon 

V. Test sequence 



Sequence 


Test 


1 


Pre-conditioning Test 


2 


Static pressure Leakage Test 


3 


Thermal Performance Test 

(Daytime and nighttime tests shall be performed 

one after another to complete one data set) 
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The following energy balance equation would determine 
the thermal performance of the solar hot water system during 
the day-time: 



Rate of change in 
energy contents of 
the water in the 
storage tan A 



Rate at which useful 
energy is sup plied to 
water in the storage tan k 
bx solar collector 



Rate at which energy 
i is lost from water 

in the storage tan k 
I to the ambient air 



dT. 



<MC) S ^= A a F' [G T (raW - U L (T s -T a )] - (A s U s + A p U p ) (T s -T a ) 

dt 



Rearranging and integrating equation with respect to time, 
between the time period tl to t2 during which the energy is 
collected, one gets 

(MC) s (T s)r T sld ) = A a F jC T (aT) 8N .dt-(A a FU L +A s U s+ ApUp) f(T s -T a )dt 

The total energy incident on the collector during the time 
period from tld to t2d is given by the following expression: 



titf 



Q c = A c JG T dt 

T|sys = flsys.o — Usys.d- X 

Whore 



Hsys - 



(MC) 5 (T dd -T id ) 



"Hs^s.o — 



u 



sys.d - 



A c J G T dt 

A 3 F' |G T (at) a ,.dt 
A c jG T .dt 



A.FUL+A.U.+ApUp 



X = 



ja-T,)* 



J G i dt 



The parameters sys,o and Usys,d are defined as the 
characteristic parameters of the thermosyphonic solar water 
heating systems. 



Night time performance: 

During night time, no solar radiation is received and also 
there is no flow of water through the solar collector. The heat 
is therefore lost primarily from the storage tank and part of the 
piping. In view of this, the equation (3) may be re-written as 
follows: 

JjJt ■ I ■ t-i H 

(MC), f_^-=-(A B U > + Ap'Up) N* 



T -j 



or 



CftCVln 



*■ s.fn ' ™ 

T —T 

1 3,iB 1 M 



"^Bys, n "cttn - tin/ 



WherG 



Usys.nAc^fAsUs + Ap' Up) 



Cooling time constant to rate the performance of the solar 
water heating system during no-radiation period may be 
expressed by the following expression: 



(MC), 



cooling, n 



(U 



A 

sys.n^c 



Day time performance: 

Solar collector is shaded completely with an opaque cover, 
white on the top exterior, at about 4 h before the solar noon. 
The storage tank of the solar water heating system is filled 
fully with water. The quantity of water filled in the storage 
tank only shall be taken as notional capacity of the solar water 
heating system. 

The water in the storage tank is fully mixed by switching 
on pump for 5 minutes before beginning of the test at the 
designated time instant. 

Initial value of the storage water temperature, Tsid is 
measured and recorded, and the shade-cover is removed at 3 h 
30 min before the solar noon. 

Measurements for ambient air temperature, water 
temperature of the storage tank, and solar irradiance on the 
plane of solar collector get commenced at the start of the test. 
The measurements are continued subsequently for the entire 
period of the test at an interval of one minute or less, however, 
recording of data is adequate at an interval of 10 minutes. 

To end the day-test cycle, the solar collector is again 
shaded on expiry of 3 h 30 min from solar noontime. The 
water in the storage tank is again mixed by operating the pump 
for five minutes, and final storage water temperature (Tsfd) is 
measured and recorded. 

The test is repeated for at least ten days with different 
values of initial storage water temperature. The first test shall 
be carried out with cold (supply) water in the storage tank. For 
subsequent days, initial water temperature shall be raised in 
uniform steps to cover up to 70°C temperature (fully mixed 
storage). 
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Different initial temperature in the storage tank may be 
achieved by adding appropriate quantity of cold water to the 
previous day's heated water in the storage tank. Alternatively, 
hot water from any other source may also be used to achieve 
the required temperature of the water in the storage tank. 

The minimum solar irradiance on the plane of solar 
collector, GT, during the period of day-test shall be 550 W/m2 
and the ratio of minimum solar irradiance to maximum solar 
irradiance recorded during the day-test should not be less than 
0.5. The data for the days for which these conditions are not 
specified, will not be used in the estimation of sys, o, and 
Usys, d. 

Based on the measured data obtained during day-test and 
night-test, the characteristic parameters of the solar hot water 
system shall be evaluated as given subsequently. 

Calculate system from the following expression using data 
recorded during the daytime test 



l]sys = 



(MCMT^-Ta) 



|*G T dt 



X = ^- 



[(L-T.KH g|(]m±U_ T 



*<V 



Ira 'd 



C T dt 



C T df 



Where T adm represents average ambient temperature 
during the mth time interval; Tsfin and Tsim represents the 
average water temperature of the two RTD"s in the storage 
tank at the end and start of the mth time interval, respectively. 
Obviously, the value of (t2m -tlm)d in the above expression is 
equivalent to 10 minutes. 

Plot a curve with values of sys on y-axis and those of X on 
x-axis. Draw a trend line using regression method of least 
squares of curve fitting, and obtain values of sys,o and Usys,d 
from the intercept of the trend line with y-axis and slope of the 
trend line respectively. 

System efficiency corresponding to the following standard 
test conditions may be calculated 

Average storage temperature = 50°C Average ambient air 
temperature during test period = 25°C Average of total solar 
radiation incident on the plane of = 700 W/m2 solar collectors 

Amount of thermal energy stored in the storage tank 
during a period of seven hours corresponding to the standard 
test conditions in kWh may be calculated as follows. 

Q = sys (corresponding to standard conditions) x Ac x 0.7 x 7 

In order to calculate cooling time constant, a plot of In 
(Tsm-Tam) versus time shall be plotted using measured data 
from the nighttime test. A trend line using regression method 
of least squares of curve fitting shall be plotted. The slope of 
the trend line shall give value for l/cooling,n Using this value 
in equation (9), the value of Usys,n may be obtained. 



LabVIEW Implementation: 

To achieve the objectives mentioned in the above section 
LabVIEW software is programmed. The LabVIEW program 
contains two parts: front panel and block diagram [20-24]. The 
front panel vi (vi - program written in LabVIEW) is where the 
user can control and/or monitor the designed system. Block 
diagram vi is used to write the program for the desired 
application. 

The front panel of the proposed system consists of a 
numerical indicator to display the efficiency of the specimen 
under test. A stop button is used to halt the process. Three 
numerical indicators to feed the tank size, aperture area, and tilt 
angle. Fig. 5 shows the front panel of the proposed technique. 
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Figure 5. Front panel of proposed technique 

The block diagram for the proposed system consists of a 
DAQ Assist to acquire the signals from sensors like RTD, 

Thermosyphon, and Pyranometer. Once the data is acquired the 

program is written to calculate the parameters like solar noon, 
and various other parameters required to calculate the calculate 
the efficiency of the solar heaters. Once the parameters of the 
desired equation is calculated the variables corresponding is 
calculated using the formula node. The data once used to 
calculate the efficiency of the solar water heaters, the results 
will be displayed using the numerical controls. 

VI. Results and conclusions 





Annu solar 
ltd 
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Natural 
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Efficiency 


53% 


56% 


59% 




Tank capacity 


100LPD 


100LPD 
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Aperture area 


1.18 m2 


1.18m2 


1.56m2 




Tilt angle 


22° 


35° 
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The data shows that the proposed technique has 
successfully achieved its proposed objective, and the 
following conclusions are made. Water temperature can be 
increased 30 to 35 degree with 600 w/m 2 radiation during a 
day. The water temperature decrease 4 to 8 degree during 
night with average ambient temperature of 20 degree at night. 
To get at least 10 days correct data more than 15 days 
experiment needed. Cold water should be pumped to tank 
during no solar condition. Solar noon varies from 11.23 am to 
11.50 am maximum during the test period. Ideal time for test 
is from October to March generally. 
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Abstract- Appropriate delivery of the services to the customer as per 
their requirement becomes a challenge to the cloud service provider 
due to the complex distributed environment. Service Level Agreement 
(SLA) is the only term between customer and service provider that 
governs the conditions to be fulfilled at the time of service delivery. A 
cloud service provider offers the services from different server 
(location) and the same service may be offered by the different 
location with different cost based on the offered quality. The intention 
of the provider will be to select an effective schedule of services taken 
from various locations with minimum cost. In this work, we have 
made an attempt to propose an agent based solution that eases the 
decision making of a cloud service provider in this situation. The 
proposed agents are able to gather these information form different 
locations and render an effective delivery schedule (which service 
from which server). 

Key Words Cloud computing, Service provisioning, Multi-agent 
based techniques, AUML 



I. INTRODUCTION: 



Cloud computing provides a cost-effective, scalable and 
adaptive platform for service provisioning. These factors are 
driving migration of legacy applications into cloud computing 
platform. Cloud offers different level of services as 
Infrastructure-as-a-service, Platform-as-a-service & Software- 
as-a-service. In Cloud every resource usage is on-demand, 
integrated and ready to use combination of services. The 
services come of compute, storage, network, platform and 
application software available as a standardized set offerings 
as a pay-as-you-use pricing model. 

The concept of cloud computing has come to make the 
computing resource virtualized and for optimized usage. 
Cloud computing is the pool of service center. Some of the 
traditional and emerging Cloud-based applications include 
social networking, web hosting, content delivery etc. Each of 
these application types has different distributed and dynamic 
system. The cloud has given the exposure to its users for 
enjoying the open and shared environment. Demand 
prediction and capacity management are crucial issues in 
cloud and hence dynamic resource allocation with the timely 
match of supply and demand [1] is required. 



Cloud resource optimization is one of the biggest challenges 
for the cloud provider. The paper [2] has discussed the idea of 
cloud resource optimization. The author shows that optimizing 
of resource is able to reduce cost substantially. Using an 
objective function the author explained the mechanism needed 
to take place to resource optimization. The demand of 
resource can be two type either it is reserved otherwise on 
demand. If the service provider has the idea of resource 
provisioning in future then the provider can plan accordingly. 
However the challenge lies in on-demand instant resource 
allocation in cloud. Some work is taken place for cost 
minimization on demand service provisioning. The paper [3] 
describes the uncertain nature and problem of resource 
provisioning in cloud. Consumer's future demand is 
unpredictable. Hence it is difficult to predict the future 
demand of a resource at any time instant. The author addresses 
this problem and proposed an algorithm by formulating a 
stochastic programming model. However the approach has 
some implementation issue and complicated in nature. Again 
the stochastic model has its own limitation. In our proposed 
model we had a plan to use of agent based model to reduce the 
pricing. The nature of agent is dynamic and it behaves 
different according to circumstances. This is the reason agent 
has proven as an important entity for controlling the cloud 
environment. The paper [4] has exposed the aspect of agent 
usage in service configuration. The author presented 
architecture of Cloud which offers functionalities integrated 
with a mobile agent platform able to dynamically add and 
configure services on the virtual clusters. Paper [5] explains 
the value models using the agent which is used for qualitative 
and quantitative purpose. The author proposes a methodology 
to model the agents for interaction of service and business 
process. 

II. PROBLEM DEFINITION 



A service provider in a distributed environment like cloud 
generally has more than one server for offering the services to 
their customer. Initially there must be a service level 
agreement (SLA) between the customer and the service 
provider and SLA denotes the typical qualification of the 
services as requested by the customer e.g. maximum cost, 
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average response time etc. Service provider can offer same 
service from different server having different cost. Here the 
cost signifies the cost needed for providing the services by the 
provider. The cost of the same service offered from different 
server may vary based on the quality of the offered service. In 
this scenario, the challenge of the service provider is to choose 
the atomic services through which a composed service can be 
rendered to the user satisfying SLA with a minimum cost. 
Here an attempt has been made to propose a solution on behalf 
of a cloud for facing the above mentioned challenge. An 
agent based approach is used for proposing a methodology 
that can be used by a cloud. 

For the sake of simplicity, here we assume an arbitrary weight 
for these two factors to calculate the cost function. But in 
future work the AI technique can be used to make the agent as 
an intelligent one about the behavior of the cloud such that the 
agent can predict the weight of the variables involved in the 
cost function. This intelligent technique will be beneficiary for 
a realistic implementation. 



III. PROPOSED SOLUTION 



In this section, we have proposed a solution for the above 
stated problem using the agent- based technology. Two types 
of agent are used here: a Stationary Agent and a Mobile 
Agent. Agents endowed with the property of mobility are 
called mobile agents. The Stationary agent is immobile and 
only takes the decision based on the information provided by 
the mobile agent. But the Mobile Agent travels the whole 
system to collect information of system statistics needed to do 
the task for the user. Finally the stationary agent takes the 
decision through the interaction with the mobile agent. 
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Figurel. Proposed Model Representation 



The Proposed Methodology: 

The mechanism is described based on the proposed framework 
as depicted in Figurel. The steps of the methodology are as 
follows: 

Step I: The Cloud provider accepts a request from the clients 
with some typical requirements in form of Service Level 
Agreements (SLA). Here we assume that the SLA is identified 
by the metrics as: Cost range, Response time, CPU load, 
Security etc. 

Step II: After receiving the request the Cloud generates a 
Stationary Agent. The Stationary Agent receives the request of 
the Cloud Consumer. This Stationary Agent is responsible to 
create a Mobile Agent for every request with specified SLA. 
Once the Mobile Agent is created then it specifies the service 
to the Mobile Agent. 

Step III: The Mobile Agent moves on each of the server 
available to provide services and checks whether the service 
quality for the requested service matches with the requested 
parameters or not. If it matches, then the Mobile Agent 
identifies the details of the identified service, the 
corresponding server, the static cost for the requested service, 
the load of the server at that time instant and the traffic 
density of the lines lead to the server. This details is returned 
to the Stationary Agent .The later mentioned parameters are 
important to take the decision that sometimes it may not be 
beneficiary i.e. cost effective for offering a service from the 
server with high load and the higher chance of network 
congestion in spite of getting a cheap solution in terms of 
static cost of that service. 

Step IV: The Stationary agent now checks the list of the 
Server which can offer the requested services. After getting the 
list it checks the corresponding servers from where the service 
can be fired and the parameters required for cost calculation. 
Here we are assuming the variables used in cost function have 
the same weight. 

Step V: The cloud now starts providing the service as per the 
newly generated schedule. 



The interactions in a distributed system are often achieved 
using communication protocols. These protocols involve 
transfer of large volumes of data stored at remote hosts over 
the network to a central processing site resulting in high 
network traffic. An alternative to using communication 
protocols is the use of mobile agents. Mobile agents are 
dispatched to the remote hosts containing the needed data. 

The agents perform the computations at the remote hosts and 
return back with the results only. Thus the communication 
cost is reduced drastically as the computations are executed at 



Dept of CSE, PESIT, Bangalore South Campus 



96 



Proceedings of RETCOMP - 2013, Volume 1 



the data storage location instead of moving data to the 
computing location. 

The task requested by a customer may be considered as a 
composition of some atomic services and the cloud is 
responsible for offering those services. Sometimes it may 
happen that the closest server (with respect to the customer) is 
able to provide the requested services but there may be a 
mismatching between the qualities. As a result there is no 
other alternative except taking the service from another server. 
There must be some static cost for each service offered from 
different server with different quality. Moreover the load on 
that server at that time instant and the network traffic leads to 
that server should be considered for cost calculation. The 
cost function should be an expression involving the static cost 
as well as the cost of communication, (as it increases based on 
the traffic density) cost generated due to the overloading on 
that server. Thus the Cost incurred for offering the requested 
service =Static cost needed for offering the chosen services + 
x (Load factor of that server) + y (Chance of congestion). Here 
we are assuming that chosen services from different server 
after composition will be able to delivery the requested 



The agent based technique is used here to trap the cost 
value from each server and take the decision 
accordingly for choosing the minimum. Here the adaptive 
nature of the agents are exploited fully as you may get the 
different solution in various time generated by an agent based 
on same SLA. 
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Activity Diagram of Agents: 

The activities of the proposed agents are described below 
through AUML design as defined in UML2.0 [7]. 



Figure 2: Stationary Agent 



Stationary Agent: 

This agent is created at the beginning and creates mobile 
agent. Once the mobile agents collect the information, this 
agent is responsible for decision making. The AUML diagram 
of Stationary agent is as given in Figure 2. 



Mobile Agent: 

Mobile agent is created by stationary agent. The mobile agent 
moves in every server and collects the status information of 
each server at every time instant. The collected information 
then sends to the Stationary agent. The AUML diagram of 
Mobile agent will be as in Figure 3. 
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Figure 3: Mobile Agent Activity Diagram 



IV. CONCLUSION 



In this work, we have tried to propose a solution for the 
efficient service provisioning from the service provider point 
of view. We have tried to propose a solution through a multi- 
agent technique and the entire concept is implemented through 
agent-based programming. Here the adaptive nature of the 
agents can be exploited. It is because the different solution is 
generated based on the other cost factors like CPU overload 
and network congestion which changes time to time. Thus the 
cost value also changes accordingly due to the different values 
of the participating variables but with same SLA. For the sake 
of simplicity, in experimental purpose, here we assume an 
arbitrary weight for these two factors to calculate the cost 
function. But in future work the AI based technique can be 
used to make the agent as an intelligent one to apprehend the 
behavior of the cloud such that the agent can predict the 
weight of the variables involved in the cost function. This 
intelligent technique will be beneficiary for a realistic 
implementation. 



5. Hoa Khanh Dam and Aditya Ghose.: An agent- 
oriented approach to service analysis and design. 
International Workshop on Services and Agents 
(ServAgents 2010) held at PRIMA 2010, Kolkata, 
India,November 2010. 

6. M. Luck. , McBurney P. and Preist. C and the 
AgentLink Community, Agent Technology: Enabling 
Next Generation Computing, Agent Technology, a 
roadmap. page 94- 1: 94, Agentlink @2003 

7. M. Kang*, L. Wang and K. Taguchi: Modeling 
Mobile Agent Applications in UML2.0 Activity 
Diagrams, School of Computing, Leeds Metropolitan 
University*, Department of Computing, School of 
Informatics, University of Bradford. Edinburgh, 
Scotland, 2004, p. 104-1 11 "Third International 
Workshop on Software Engineering for Large-Scale 
Multi- Agent Systems (SELMAS'04)"@2004 



Dept of CSE, PESIT, Bangalore South Campus 



98 



